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1A 

METHODS OF DIAG NOSIS OF OVARIAN CANCER. COMPOSITIONS AND 
METHODS OF SCR EENING' FOR MODULATORS OF OVARIAN CANCER 

Field of the invention 

5 The present invention relates to the identification of nucleic acid and 

protein expression profiles and nucleic acids, products, and antibodies thereto 
that are involved in ovarian cancer; and to the use of such expression profiles and 
compositions in the diagnosis, prognosis and therapy of ovarian cancer The 
invention further relates to methods for identifying and using agents and/or 

10 targets that inhibit ovarian cancer. 

Background of the invention 

Ovarian cancer is the fourth most frequent cause of cancer death in 
females and in the United States, and accounts for approximately 13,000 deaths 

15 . annually. Furthermore, ovarian cancer remains the number one killer of women 
with gynaecological malignant hyperplasia and the incidence is rising in 
industrialized countries. The etiology of the neoplastic transformation remains 
unknown although there is epidemiological evidence for an association with 
disordered endocrine function. The incidence of ovarian carcinoma is higher in 

20 nulliparous females and in those with early menopause. 

Approximately 75% of women diagnosed with such cancers are already at 
an advanced stage (III and IV) of the disease at their initial diagnosis. During the 
past 20 years, neither diagnosis nor five year survival rates have greatly improved 
for these patients. This is substantially due. to the high percentage of high-stage 

25 initial detection of the disease. There is therefore a need to develop new markers 
that improve early diagnosis and thereby reduce the percentage of high-stage 
initial diagnoses. 

Summary of the invention 

30 The present invention is based on the identification of a number of genes 

whose expression is altered in individuals with ovarian cancer versus individuals 
who do not have ovarian cancer. The particular genes are identified in the 
Tables. 

Accordingly, the present invention provides in a first aspect, a method of 
35 detecting an ovarian cancer-associated transcript in a biological sample, the 
method comprising contacting the biological sample with a polynucleotide that 
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selectively hybridises to a sequence at least 80% identical to a sequence as 
shown in the Tables. 

Preferably the polynucleotide selectively hybridises to a sequence at least 
95% identical to a sequence as shown in the Tables. More preferably, the 
5 polynucleotide comprises a sequence as shown in the Tables. 

In one embodiment, the method further comprises the step of amplifying 
nucleic acids before the step of contacting the biological sample with the 
polynucleotide. 

In a preferred embodiment, the polynucleotide is immobilised on a solid 
io surface. 

In a related aspect, the present invention provide? a method of detecting 
an ovarian cancer-associated polypeptide in a biological sample the method 
comprising contacting the biological sample with an antibody which binds 
specifically to the ovarian cancer-associated polypeptide in the biological sample, 
15 the polypeptide being encoded by a polynucleotide that selectively hybridises to a 
sequence at least 80% identical to a sequence as shown in the Tables. 

In a further related aspect, the present invention provides a method of 
detecting an ovarian cancer-associated antibody in a biological sample the 
method comprising contacting the biological sample with a polypeptide encoded 
20 by a polynucleotide that selectively hybridises to a sequence at least 80% 
identical to a sequence as shown in the Tables, wherein the polypeptide 
specifically binds to the ovarian cancer-associated antibody. 

Preferably, in the above methods, the biological sample is contacted with a 
plurality of the polynucleotides, polypeptides or antibodies referred to above. 
25 The biological sample is typically a tissue sample or a blood sample. 

In one embodiment, the biological sample is from a patient undergoing a 
therapeutic regimen to treat ovarian cancer. In an alternative embodiment, the 
biological sample is from a patient suspected of having ovarian cancer. 

In a second aspect, the present invention a method of monitoring the 
30 efficacy of a therapeutic treatment of ovarian cancer, the method comprising the 
steps of: 

(i) providing a biological sample from a patient undergoing the therapeutic 
treatment; and 

(ii) determining the level of a ovarian cancer-associated transcript in the 
35 biological sample by contacting the biological sample with a polynucleotide 

that selectively hybridises to a sequence at least 80% identical to a 
sequence as shown in the Tables, thereby monitoring the efficacy of the 
therapy. 



Preferably the method further comprises the step of: (iii) comparing the 
level of the ovarian cancer-associated transcript- to a level of the ovarian cancer- 
associated transcript in a biological sample from the patient prior to, or earlier in, 
the therapeutic treatment. 

In a related aspect, the present invention provides a method of monitoring 
the efficacy of a therapeutic treatment of ovarian cancer, the method comprising 
the steps of: 

(i) providing a biological sample from a patient undergoing the therapeutic 
treatment; and 

(ii) determining the level of a ovarian cancer-associated antibody in the 
biological sample by contacting the biological sample with a polypeptide 
encoded by a polynucleotide that selectively hybridises to a sequence at 
least 80% identical to a sequence as shown in the Tables, wherein the 
polypeptide specifically binds to the ovarian cancer-associated antibody, 
thereby monitoring the efficacy of the therapy. » 

Preferably the method further comprises the step of: (iii) comparing the 
level of the ovarian cancer-associated antibody to a level of the ovarian cancer- 
associated antibody in a biological sample from the patient prior to, or earlier in, 
the therapeutic treatment. 

In a further related aspect, the present invention provides a method of 
monitoring the efficacy of a therapeutic treatment of ovarian cancer, the method 
comprising the steps of: 

(i) providing a biological sample from a patient undergoing the therapeutic 
treatment; and 

(ii) determining the level of a ovarian cancer-associated polypeptide in the 
biological sample by contacting the biological sample with an antibody, 
wherein the antibody specifically binds to a polypeptide encoded by a 
polynucleotide that selectively hybridises to a sequence at least 80% 
identical to a sequence as shown in the Tables, thereby monitoring the 
efficacy of the therapy. 

Preferably the method further comprises the step of: (iii) comparing the 
level of the ovarian cancer-associated polypeptide to a level of the ovarian 
cancer-associated polypeptide in a biological sample from the patient prior to, or 
earlier in, the therapeutic treatment. 

Preferably, in the above methods, the biological sample is contacted with a 
plurality of the polynucleotides, polypeptides or antibodies referred to above. 
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In a third aspect, the present invention provides an isolated polynucleotide 
selected from the group consisting of: 

(a) polynucleotides comprising a nucleotide sequence as shown in the 
Tables, or the complement thereof; 

(b) polynucleotides comprising a nucleotide sequence capable of 
selectively hybridising to a nucleotide sequence as shown in the Tables; 

(c) polynucleotides comprising a nucleotide sequence capable of 
selectively hybridising to the complement of a nucleotide sequence as 
shown in the Tables; and 

(d) polynucleotides comprising a polynucleotide sequence which is 
degenerate as a result of the genetic code to the polynucleotides defined in 
(a), (b)or(c). 

Preferred polynucleotides, comprise a polynucleotide sequence as shown in 
15 the Tables or a sequence having at least 80% homology thereto. 

The present invention also provides a nucleic acid vector comprising a 
polynucleotide of the invention. In one embodiment, the polynucleotide is operably 
linked to a regulatory control sequence capable of directing expression of the 
polynucleotide in a host cell. 

20 The present invention further provides a host cell comprising a vector of the 

invention. 

In a fourth aspect, the present invention provides an isolated polypeptide 
which is encoded by a polynucleotide of the invention. The present invention also 
provides an isolated polypeptide encoded by a polynucleotide that selectively 
25 hybridises to a sequence at least 80% identical to a sequence as shown in the 
Tables. 

In a fifth aspect the present invention provides an antibody that binds 
specifically a polypeptide of the invention. 

In a sixth aspect, the present invention provides a method for identifying a 
30 compound that modulates an ovarian cancer-associated polypeptide, the method 
comprising the steps of: 

(i) contacting the compound with a ovarian cancer-associated 
polypeptide, the polypeptide encoded by a polynucleotide that selectively 
hybridises to a sequence at least 80% identical to a sequence as shown in 

35 the Tables; and 

(ii) determining the functional effect of the compound upon the 
polypeptide. 
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The functional effect may, for example, be a physical effect or a chemical 

effect. 

In one embodiment, the functional effect is determined by measuring ligand 
binding to the polypeptide. 
5 In a particular embodiment, the polypeptide is expressed in a eukaryotic 

host cell or cell membrane. 

Preferably the polypeptide is recombinant. 

In another aspect, the present invention provides a method of inhibiting 
proliferation of a ovarian tumour cell, which method comprises contacting said cell 
10 with a compound identified using the method of the sixth aspect of the invention. 

In a further aspect, the present invention provides a method of inhibiting 
proliferation of a ovarian cancer-associated cell to treat ovarian cancer in a 
patient, the method comprising the step of administering to the patient a 
therapeutically effective amount of a compound identified using the method of the 
15 sixth aspect of the invention. 

In a seventh aspect, the present invention provides a drug screening assay 
comprising the steps of: 

(i) administering a test compound to a mammal having ovarian cancer or 
a cell isolated therefrom; 
20 (») comparing the level of gene expression of a polynucleotide that 

selectively hybridises to a sequence at least 80% identical to a sequence 
as shown in the Tables in a treated cell or mammal with the level of gene 
expression of the polynucleotide in a control cell or mammal, wherein a test 
compound that modulates the level of expression of the polynucleotide is a 
25 candidate for the treatment of ovarian cancer. 

Typically, the control is a mammal with ovarian cancer or a cell therefrom 
that has not been treated with the test compound. Alternatively, the control is a 
normal cell or mammal. 

The present invention also provides a method for treating a mammal 
30 having ovarian cancer comprising administering a compound identified by the 
assay of the seventh aspect of the invention. 

In an eighth aspect, the present invention provides a pharmaceutical 
composition for use in treating a mammal having ovarian cancer, the composition 
comprising a compound identified by the method of the sixth aspect of the 
35 invention, or the assay of the seventh aspect of the invention, and a 
physiologically acceptable carrier or diluent. 
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In a ninth aspect, the present invention provides an assay device 
comprising a plurality of different polynucleotides of the invention to a solid phase. 
Preferably, the solid phase is a substantially planar chip. 

In a related aspect, the present invention provides an assay device 
5 comprising a plurality of different antibodies of the invention immobilised to a solid 
phase. Preferably, the solid phase is a substantially planar chip. 

The present invention also provides the use of a device of the invention in 
a method of the first or second aspect of the invention or an assay method of the 
seventh aspect of the invention. 
10 In a tenth aspect, the present invention provides a non-human transgenic 

animal which is transgenic by virtue of comprising a polynucleotide of the 
invention as well as transgenic "knock-out" animals that have a disrupted gene 
corresponding to a polynucleotide of the invention. 

Preferred sequences in relation to any or all of the aspects of the invention 
15 referred to above, and throughout the specification, are those that are listed in 
one or more of the Tables as having a P value of less than 0.05, more preferably 
a P value of less than 0.01 . Similarly, preferred sequences are those referred to 
in Table 3B as having a ratio of expression between ovarian patients and normal 
patients of at least 5, more preferably at least 6, 7, 8, 9 or 10, and those 
20 sequences referred to in Table 3B as having a ratio of expression between 
ovarian patients and normal patients of less than -5 or -1 0. 

Throughout this specification, preferred aspects and embodiments apply, 
as appropriate, separately, or in combination, to other aspects and embodiments, 
mutatis mutandis, whether or not explicitly stated as such. 

25 

Detailed description of the invention 

Unless defined otherwise, all technical and scientific terms used herein 
have the same meaning as commonly understood by one of ordinary skill in the 
art (e.g. in cell culture, molecular genetics, nucleic acid chemistry, hybridization 

30 techniques and biochemistry). Standard techniques are used for molecular, 
genetic and biochemical methods (see generally, Sambrook et a/., Molecular 
Cloning: A Laboratory Manual, 3 rd ed. (2001) Cold Spring Harbor Laboratory 
Press, Cold Spring Harbor, N.Y. and Ausubel etai, Short Protocols in Molecular 
Biology (1999) 4 th Ed, John Wiley & Sons, Inc. - and the full version entitled 

35 Current Protocols in Molecular Biology, which are incorporated herein by 
reference) and chemical methods. 



Throughout this specification the word "comprise", or variations such as 
"comprises" or "comprising", will be understood to imply the inclusion of a stated 
element, integer or step, or group of elements, integers or steps, but not the 
exclusion of any other element, integer or step, or group of elements, integers or 
5 steps. 

The present invention provides nucleic acid and protein sequences that are 
differentially expressed in ovarian 'cancer, herein termed "ovarian cancer 
sequences." As outlined below, ovarian cancer sequences include those that are 
up-regulated (i.e., expressed at a higher level) in ovarian cancer, as well as those 

10 that are down-regulated (i.e., expressed at a lower level). In a preferred 
embodiment, the ovarian cancer sequences are from humans; however, as will 
be appreciated by those in the art, ovarian cancer sequences from other 
organisms may be useful in animal models of disease and drug evaluation; thus, 
other ovarian cancer sequences are provided, from vertebrates, including 

15 mammals, including rodents (rats, mice, hamsters, guinea pigs, etc.), primates, 
farm animals (including sheep, goats, pigs, cows, horses, etc.) and pets, e.g., 
(dogs, cats, etc.). 

Ovarian cancer sequences from other organisms may be obtained using 
the techniques outlined below. Ovarian cancer sequences can include both 

20 nucleic acid and amino acid sequences. As will be appreciated by those in the art 
and is more fully outlined below, ovarian cancer nucleic acid sequences are 
useful in a variety of applications, including diagnostic applications, which will 
detect naturally occurring nucleic acids, as well as screening applications; e.g., 
biochips comprising nucleic acid probes or PCR microtitre plates with selected 

25 probes to the ovarian cancer sequences can be generated. 

An ovarian cancer sequence can be initially identified by substantial nucleic 
acid and/or amino acid sequence homology to the ovarian cancer sequences 
outlined herein. Such homology can be based upon the overall nucleic acid, or 
amino acid sequence, and is generally determined as outlined below, using either 

30 homology programs or hybridization conditions. 

For identifying ovarian cancer-associated sequences, the ovarian cancer 
screen typically includes comparing genes identified in different tissues, e.g., 
normal and cancerous tissues, or tumour tissue samples from patients who have 
metastatic disease vs. non metastatic tissue. Other suitable tissue comparisons 

35 include comparing ovarian cancer samples with metastatic cancer samples from 
other cancers, such as lung, breast, gastrointestinal cancers, ovarian, etc. 
Samples of different stages of ovarian cancer, e.g., survivor tissue, drug resistant 
states, and tissue undergoing metastasis, are applied to biochips comprising 
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nucleic acid probes. The samples are first microdissected, if applicable, and 
treated as is known in the art for the preparation of mRNA. Suitable biochips are 
commercially available, e.g. from Affymetrix. Gene expression profiles as 
described herein are generated and the data analyzed. 
5 In one embodiment, the genes showing changes in expression as between 

normal and disease states are compared to genes expressed in other normal 
tissues, preferably normal ovarian, but also including, and not limited to lung, 
heart, brain, liver, breast, kidney, muscle, colon, small intestine, large intestine, 
spleen, bone and placenta. In a preferred embodiment, those genes identified 

10 during the ovarian cancer screen that are expressed in any significant amount in 
other tissues are removed from the profile, although in some embodiments, this is 
not necessary. That is, when screening for drugs, it is usually preferable that the 
target be disease specific, to minimise possible side effects. 

In a preferred embodiment, ovarian cancer sequences are those that are 

15 up-regulated in ovarian cancer, that is, the expression of these genes is higher in 
the ovarian cancer tissue as compared to non-cancerous tissue (see Table 1). 
"Up-regulation" as used herein means at least about a two-fold change, 
preferably at least about a three fold change, with at least about five-fold or higher 
being preferred. All Unigene cluster identification numbers and accession 

20 numbers herein are for the GenBank sequence database and the sequences of 
the accession numbers are hereby expressly incorporated by reference. 
Sequences are also available in other databases, e.g., European Molecular 
Biology Laboratory (EMBL) and DNA Database of Japan (DDBJ). 

In another preferred embodiment, ovarian cancer sequences are those that 

25 are down-regulated in ovarian cancer; that is, the expression of these genes is 
lower in ovarian cancer tissue as compared to non-cancerous tissue (see Table 2 
for example). "Down-regulation" as used herein often means at least about a 
1.5-fold change more preferably a two-fold change, preferably at least about a 
three fold change, with at least about five-fold or higher being most preferred. 

30 Preferred sequences are those referred to in the Tables that are listed in 

one or more of the Tables as having a P value of less than 0.05, more preferably 
a P value of less than 0.01. Similarly, preferred sequences are those referred to 
in Table 3B as having a ratio of expression between ovarian patients and normal 
patients of at least 5, more preferably at least 6, 7, 8, 9 or 10, and those 

35 sequences referred to in Table 3B as having a ratio of expression between 
ovarian patients and normal patients of less than -5 or -10. 



9 



Polynucleotides 

Polynucleotides of the invention comprise the nucleic acid sequences 
whose nucleotide sequences are provided by reference to the public database 
5 accession numbers given in the Tables (referred to herein as the nucleotide 
sequences shown in the Tables), and sequences homologues thereto as well as 
variants, derivatives and fragments thereof. 

The nucleotide sequences referred to in the Tables and homologues 
thereof, typically encode polypeptides. It will be understood by a skilled person 

10 that numerous different polynucleotides can encode the same polypeptide as a 
result of the degeneracy of the genetic code. In addition, it is to be understood that 
skilled persons may, using routine techniques, make nucleotide substitutions that do 
not affect the polypeptide sequence encoded by the polynucleotides of the invention 
to reflect the codon usage of any particular host organism in which the polypeptides 

15 of the invention are to be expressed. 

Polynucleotides of the invention may comprise DNA or RNA. They may be 
single-stranded or double-stranded. They may also be polynucleotides which 
include within them synthetic or modified nucleotides. A number of different types 
of modification to oligonucleotides are known in the art. These include 

20 methylphosphonate and phosphorothioate backbones, addition of acridine or 
polylysine chains at the 3' and/or 5' ends of the molecule. For the purposes of the 
present invention, it is to be understood that the polynucleotides described herein 
may be modified by any method available in the art. Such modifications may be 
carried out in order to enhance the in vivo activity or life span of polynucleotides 

25 of the invention. 

The terms 'Variant" or "derivative" in relation to the nucleotide sequences of 
the present invention include any substitution of, variation of, modification of, 
replacement of, deletion of or addition of one (or more) nucleic acid from or to the 
sequence provided that the resultant nucleotide sequence codes for a polypeptide 

30 having biological activity, preferably having substantially the same activity as the 
polypeptide sequences presented in the sequence listings. 

With respect to sequence homology, preferably there is at least 75%, more 
preferably at least 85%, more preferably at least 90% homology to a sequence 
shown in the Tables herein over a region of at least 20, preferably at least 25 or 30, 

35 for instance at least 40, 60, 100, 500, 1000 or more contiguous nucleotides. More 
preferably there is at least 95%, more preferably at least 98%, homology. In one 
embodiment, homologues are naturally occurring sequences, such as orthologues, 
tissue-specific isoforms and allelic variants. 
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Homology comparisons can be conducted by eye, or more usually, with the 
aid of readily available sequence comparison programs. These commercially 
available computer programs can calculate % homology between two or more 
sequences. 

5 % homology may be calculated over contiguous sequences, i.e. one 

sequence is aligned with the other sequence and each nucleotide in one sequence 
directly compared with the corresponding nucleotide in the other sequence, one 
base at a time. This is called an "ungapped" alignment. Typically, such ungapped 
alignments are performed only over a relatively short number of bases (for example 

10 less than 50 contiguous nucleotides). 

Although this is a very simple and consistent method, it fails to take into 
consideration that, for example, in an otherwise identical pair of sequences, one 
insertion or deletion will cause the following nucleotides to be put out of alignment, 
thus potentially resulting in a large reduction in % homology when a global 

15 alignment is performed. Consequently, most sequence comparison methods are 
designed to produce optimal alignments that take into consideration possible 
insertions and deletions without penalising unduly the overall homology score. This 
is achieved by inserting "gaps" in the sequence alignment to try to maximise local 
homology. 

20 However, these more complex methods assign "gap penalties" to each gap 

that occurs in the alignment so that, for the same number of identical amino acids, a 
sequence alignment with as few gaps as possible - reflecting higher relatedness 
between the two compared sequences - will achieve a higher score than one with 
many gaps. "Affine gap costs'* are typically used that charge a relatively high cost for 

25 the existence of a gap and a smaller penalty for each subsequent residue in the 
gap. This is the most commonly used gap scoring system. High gap penalties will of 
course produce optimised alignments with fewer gaps. Most alignment programs 
allow the gap penalties to be modified. However, it is preferred to use the default 
values when using such software for sequence comparisons. 

30 Calculation of maximum % homology therefore firstly requires the production 

of an optimal alignment, taking into consideration gap penalties. A suitable 
computer program for carrying out such an alignment is the GCG Wisconsin Bestfit 
package (University of Wisconsin, U.S.A; Devereux etal., 1984, Nucleic Adds 
Research 12:387). The default scoring matrix has a match value of 10 for each 

35 identical nucleotide and -9 for each mismatch. The default gap creation penalty is - 
50 and the default gap extension penalty is -3 for each nucleotide. 

Examples of other software than can perform sequence comparisons include, 
but are not limited to, the BLAST package (see Ausubel et a/., 1999 ibid - 
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Chapter 18), FASTA (Atschul et al., 1990, J. Mol. Biol., 403^10) and the 
GENEWORKS suite of comparison tools. Both BLAST and FASTA are available 
for offline and online searching (see Ausubel ef al., 1999 ibid, pages 7-58 to 
7-60). However it is preferred to use the GCG Bestfit program. 

Once the software has produced an optimal alignment, it is possible to 
calculate % homology, preferably % sequence identity. The software typically 
does this as part of the sequence comparison and generates a numerical result. 

A preferred sequence comparison program is the GCG Wisconsin Bestfit 
program described above. 

The present invention also encompasses nucleotide sequences that are 
capable of hybridising selectively to the sequences presented herein, or any variant, 
fragment or derivative thereof, or to the complement of any of the above. Nucleotide 
sequences are preferably at least 15 nucleotides in length, more preferably at least 
20, 30, 40 or 50 nucleotides in length. 

The term "hybridization" as used herein shall include "the process by which 
a strand of nucleic acid joins with a complementary strand through base pairing" 
as well as the process of amplification as carried out in polymerase chain reaction 
technologies. 

Polynucleotides of the invention capable of selectively hybridising to the 
nucleotide sequences presented herein, or to their complement, will be generally at 
least 70%, preferably at least 80 or 90% and more preferably at least 95% or 98% 
homologous to the corresponding nucleotide sequences referred to in the Tables 
over a region of at least 20, preferably at least 25 or 30, for instance at least 40, 60, 
100, 500, 1000 or more contiguous nucleotides. 

The term "selectively hybridizable" means that the polynucleotide used as a 
probe is used under conditions where a target polynucleotide of the invention is 
found to hybridize to the probe at a level significantly above background. The 
background hybridization may occur because of other polynucleotides present, for 
example, in the cDNA or genomic DNA library being screening. In this event, 
background implies a level of signal generated by interaction between the probe 
and a non-specific DNA member of the library which is less than 10 fold, preferably 
less than 100 fold as intense as the specific interaction observed with the target 
DNA The intensity of interaction may be measured, for example, by radiolabelling 
the probe, e.g. with 32 P. 

Hybridization conditions are based on the melting temperature (Tm) of the 
nucleic add binding complex, as taught in Berger and Wmmel (1987, Guide to 
Molecular Cloning Techniques, Methods in Enzymology, Vol 152, Academic 
Press, San Diego CA), and confer a defined "stringency" as explained below. 
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Maximum stringency typically occurs at about Tm-5°C (5°C below the Tm 
of the probe); high stringency at about 5°C to 10°C below Tm; intermediate 
stringency at about 10°C to 20°C below Tm; and low stringency at about 20°C to 
25°C below Tm. As will be understood by those of skill in the art, a maximum 
5 stringency hybridization can be used to identify or detect identical polynucleotide 
sequences while an intermediate (or low) stringency hybridization can be used to 
identify or detect similar or related polynucleotide sequences. 

In a preferred aspect, the present invention covers nucleotide sequences that 
can hybridise to the nucleotide sequence of the present invention under stringent 
10 conditions (e.g. 65°C and O.lxSSC {1xSSC = 0.15 M NaCI, 0.015 M Naa Citrate pH 
7.0}). 

Where the polynucleotide of the invention is double-stranded, both strands of 
the duplex, either individually or in combination, are encompassed by the present 
invention. Where the polynucleotide is single-stranded, it is to be understood that 

15 the complementary sequence of that polynucleotide is also included within the 
scope of the present invention. 

Polynucleotides which are not 100% homologous to the sequences of the 
present invention but fall within the scope of the invention can be obtained in a 
number of ways. Other variants of the sequences described herein may be obtained 

20 for example by probing DNA libraries made from a range of individuals, for example 
individuals from different populations. In particular, given that that changes in the 
expression of nucleotide sequences of the invention correlate with ovarian cancer, 
characterisation of variant sequences in individuals suffering from ovarian cancer 
may be used to identify variations in the sequences of ovarian-cancer associated 

25 polynucleotides (and proteins) that predictive of and/or causative of, ovarian cancer. 

Accordingly the present invention provides methods of identifying sequence 
variations that are associated with ovarian cancer which methods comprise 
determining all or part of the nucleotide sequence of a gene referred to in the 
Tables, derived from an individual suffering from ovarian cancer and comparing the 

30 sequence to that of the corresponding wild-type gene. 

In addition, other viral/bacterial, or cellular homologues particularly cellular 
homologues found in mammalian cells (e.g. rat, mouse, bovine and primate cells), 
may be obtained and such homologues and fragments thereof in general will be 
capable of selectively hybridising to the sequences shown in the sequence listing 

35 herein. Such sequences may be obtained by probing cDNA libraries made from or 
genomic DNA libraries from other animal species, and probing such libraries with 
probes comprising all or part of the sequences referred to in the Tables under 
conditions of medium to high stringency. Similar considerations apply to obtaining 
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species homologues and allelic variants .of the nucleotide sequences referred to in 
the Tables. 

Variants and strain/species homologues may also be obtained using 
degenerate PCR which will use primers designed to target sequences within the 

5 variants and homologues encoding conserved amino acid sequences within the 
sequences of the present invention. Conserved sequences can be predicted, for 
example, by aligning the amino acid sequences from several variants/homologues. 
Sequence alignments can be performed using computer software known in the art. 
For example the GCG Wisconsin PileUp program is widely used. 

10 The primers used in degenerate PCR will contain one or more degenerate 

positions and will be used at stringency conditions lower than those used for cloning 
sequences with single sequence primers against known sequences. 

Alternatively, such polynucleotides may be obtained by site-directed 
mutagenesis of characterised sequences, such as the sequences referred to in the 

15 Tables. This may be useful where for example silent codon changes are required to 
sequences to optimise codon preferences for a particular host cell in which the 
polynucleotide sequences are being expressed. Other sequence changes may be 
desired in order to introduce restriction enzyme recognition sites, or to alter the 
property or function of the polypeptides encoded by the polynucleotides. 

20 Polynucleotides of the invention may be used to produce a primer, e.g. a 

PCR primer, a primer for an alternative amplification reaction, a probe e.g. labelled 
with a revealing label by conventional means using radioactive or non-radioactive 
labels, or the polynucleotides may be cloned into vectors. Such primers, probes and 
other fragments will be at least 15, preferably at least 20, for example at least 25, 30 

25 or 40 nucleotides in length, and are also encompassed by the term polynucleotides 
of the invention as used herein. Preferred fragments are less than 5000, 2000, 
1 000, 500 or 200 nucleotides in length. 

Polynucleotides such as a DNA polynucleotides and probes according to the 
invention may be produced recombinantly, synthetically, or by any means available 

30 to those of skill in the art. They may also be cloned by standard techniques. 

In general, primers will be produced by synthetic means, involving a step 
wise manufacture of the desired nucleic acid sequence one nucleotide at a time. 
Techniques for accomplishing this using automated techniques are readily available 
in the art. 
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Longer polynucleotides will generally be produced using recombinant means, 
for example using PCR (polymerase chain reaction) cloning techniques. This will 
involve making a pair of primers (e.g. of about 15 to 30 nucleotides) flanking a 
region of the sequence which it is desired to clone, bringing the primers into contact 
5 with mRNA or cDNA obtained from an animal or human cell, performing a 
polymerase chain reaction under conditions which bring about amplification of the 
desired region, isolating the amplified fragment (e.g. by purifying the reaction 
mixture on an agarose gel) and recovering the amplified DNA The primers may be 
designed to contain suitable restriction enzyme recognition sites so that the 

10 amplified DNA can be cloned into a suitable cloning vector 

Polynucleotides or primers of the invention may carry a revealing label. 
Suitable labels include radioisotopes such as 32 P or ^S, enzyme labels, or other 
protein labels such as biotin. Such labels may be added to polynucleotides or 
primers of the invention and may be detected using by techniques known in the 

15 art. 

Polynucleotides or primers of the invention or fragments thereof labeled or 
unlabeled may be used by a person skilled in the art in nucleic acid-based tests 
for detecting or sequencing polynucleotides of the invention in the human or 
animal body. 

20 Such tests for detecting generally comprise bringing a biological sample 

containing DNA or RNA into contact with a probe comprising a polynucleotide or 
primer of the invention under hybridising conditions and detecting any duplex 
formed between the probe and nucleic acid in the sample. Such detection may be 
achieved using techniques such as PCR or by immobilising the probe on a solid 

25 support, removing nucleic acid in the sample which is not hybridised to the probe, 
and then detecting nucleic acid which has hybridised to the probe. Alternatively, 
the sample nucleic acid may be immobilised on a solid support, and the amount 
of probe bound to such a support can be detected. Suitable assay methods of this 
and other formats can be found in for example W089/03891 and W090/13667. 

30 Tests for sequencing nucleotides of the invention include bringing a 

biological sample containing target DNA or RNA into contact with a probe 
comprising a polynucleotide or primer of the invention under hybridising 
conditions and determining the sequence by, for example the Sanger dideoxy 
chain termination method (see Sambrook et al.). 

35 Such a method generally comprises elongating, in the presence of suitable 

reagents, the primer by synthesis of a strand complementary to the target DNA or 
RNA and selectively terminating the elongation reaction at one or more of an A, 
C, G or T/U residue; allowing strand elongation and termination reaction to occur; 
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separating out according to size the elongated products to determine the 
sequence of the nucleotides at which selective termination has occurred. Suitable 
reagents include a DNA polymerase enzyme, the deoxynucleotides dATP, dCTP, 
dGTP and dTTP, a buffer and ATP. Dideoxynucleotides are used for selective 
5 termination. 

Tests for detecting or sequencing nucleotides of the invention in a 
biological sample may be used as part of the methods of the invention for 
detecting ovarian cancer-associated transcripts and monitoring the efficacy of 
treatment of patients suffering from ovarian cancer as will be described in more 
10 details below. 

The probes/primers of the invention may conveniently be packaged in the 
form of a test kit in a suitable container. In such kits the probe may be bound to a 
solid support where the assay format for which the kit is designed requires such 
binding. The kit may also contain suitable reagents for treating the sample to be 

15 probed, hybridising the probe to nucleic add in the sample, control reagents, 
instructions, and the like. 

Preferably, a kit of the invention comprises primers/probes suitable for 
selectively detecting a plurality of sequences, more preferably for selectively 
detecting a plurality of sequences that are listed in one or more of the Tables as 

20 having a P value of less than 0.05, more preferably a P value of less than 0.01. 
Similarly, a kit of the invention preferably comprises primers suitable for 
selectively detecting a plurality of sequences referred to in Table 3B as having a 
ratio of expression between ovarian patients and normal patients of at least 5, 
more preferably at least 6, 7, 8, 9 or 10, and those sequences referred to in Table 

25 3B as having a ratio of expression between ovarian patients and normal patients 
of less than -5 or -10. 

DNA Array technologies 

Techniques for producing immobilised arrays of DNA molecules have been 
30 described in the art. Generally, most prior art methods describe how to synthesise 
single-stranded nucleic acid molecule arrays, using for example masking 
techniques to build up various permutations of sequences at the various discrete 
positions on the solid substrate. U.S. Patent No. 5,837,832, the contents of which 
are incorporated herein by reference, describes an improved method for producing 
35 DNA arrays immobilised to silicon substrates based on very large scale integration 
technology. In particular, U.S. Patent No. 5,837,832 describes a strategy called 
"tiling" to synthesize specific sets of probes at spatially-defined locations on a 
substrate which may be used to produced the immobilised DNA arrays. U.S. Patent 
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No. 5,837,832 also provides references for earlier techniques that may also be 
used. 

Thus DNA may be synthesised in situ on the surface of the substrate. 
However, DNA may also be printed directly onto the substrate using for example 
5 robotic devices equipped with either pins or piezoelectric devices. 

The plurality of polynucleotide sequences are typically immobilised onto or 
in discrete regions of a solid substrate. The substrate may be porous to allow 
immobilisation within the substrate or substantially non-porous, in which case the 
library sequences are typically immobilised on the. surface of the substrate. The 

io solid substrate may be made of any material to which polypeptides can bind, 
either directly or indirectly. Examples of suitable solid substrates include flat 
glass, silicon wafers, mica, ceramics and organic polymers such as plastics, 
including polystyrene and polymethacrylate. It may also be possible to use semi- 
permeable membranes such as nitrocellulose or. nylon membranes, which are 

15 widely available. The semi-permeable membranes may be mounted on a more 
robust solid surface such as glass. The surfaces may optionally be coated with a 
layer of metal, such as gold, platinum or other transition metal. A particular 
example of a suitable solid substrate is the commercially available BIACore™ 
chip (Pharmacia Biosensors). 

20 Preferably, the solid substrate is generally a material having a rigid or semi- 

rigid surface. In preferred embodiments, at least one surface of the substrate will 
be substantially flat, although in some embodiments it may be desirable to 
physically separate synthesis regions for different polymers with, for example, 
raised regions or etched trenches. It is also preferred that the solid substrate is 

25 suitable for the high density application of DNA sequences in discrete areas of 
typically from 50 to 100 urn, giving a density of 10000 to 40000 cm -2 . 

The solid substrate is conveniently divided up into sections. This may be 
achieved by techniques such as photoetching, or by the application of 
hydrophobic inks, for example teflon-based inks (Cel-line, USA). 

30 Discrete positions, in which each different member of the array is located 

may have any convenient shape, e.g., circular, rectangular, elliptical, wedge- 
shaped, etc. 

Attachment of the polynucleotide sequences to the substrate may be by 
covalent or non-covalent means. The plurality of polynucleotide sequences may 
35 be attached to the substrate via a layer of molecules to which the sequences 
bind. For example, the sequences may be labelled with biotin and the substrate 
coated with avidin and/or streptavidin. A convenient feature of using biotinylated 
sequences is that the efficiency of coupling to the solid substrate can be 
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determined easily. Since the library sequences may bind only poorly to some 
solid substrates, it is often necessary to provide a chemical interface between the 
solid substrate (such as in the case of glass) and the sequences. Examples of 
suitable chemical interfaces include hexaethylene glycol. Another example is the 
use of polylysine coated glass, the polylysine then being chemically modified 
using standard procedures to introduce an affinity ligand. Other methods for 
attaching molecules to the surfaces of solid substrate by the use of coupling 
agents are known in the art, see for example W098/49557. 

The complete DNA array is typically read at the same time by charged 
coupled device (CCD) camera or confocal imaging system. Alternatively, the 
DNA array may be placed for detection in a suitable apparatus that can move in 
an x-y direction, such as a plate reader. In this way, the change in characteristics 
for.each discrete position can be measured automatically by computer controlled 
movement of the array to place each discrete element in turn in line with the 
15 detection means. 

The detection means are capable of interrogating each position in the 
library array optically or electrically. Examples of suitable detection means 
include CCD cameras or confocal imaging systems. 

20 Nucleotide vectors 

Polynucleotides of the invention can be incorporated into a recombinant 
rephcable vector. The vector may be used to replicate the nucleic acid in a 
compatible host cell. Thus in a further embodiment, the invention provides a 
method of making polynucleotides of the invention by introducing a polynucleotide 
of the invention into a replicable vector, introducing the vector into a compatible 
host cell, and growing the host cell under conditions which bring about replication 
of the vector. The vector may be recovered from the host cell. Suitable host cells 
include bacteria such as E. coli, yeast, mammalian cell lines and other eukaryotic 
cell lines, for example insect Sf9 cells. 

Preferably, a polynucleotide of the invention in a vector is operably linked 
to a control sequence that is capable of providing for the expression of the coding 
sequence by the host cell, i.e. the vector is an expression vector. The term 
"operably linked" means that, the components described are in a relationship 
permitting them to function in their intended manner. A regulatory sequence 
"operably Imked" to a coding sequence is ligated in such a way that expression of 
the coding sequence is achieved under condition compatible with the control 
sequences. 
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The control sequences may be modified, for example by the addition of 
further transcriptional regulatory elements to make the level of transcription 
directed by the control sequences more responsive to transcriptional modulators. 

Vectors of the invention may be transformed or transfected into a suitable 
host cell as described below to provide for expression of a protein of the 
invention. This process may comprise culturing a host cell transformed with an 
expression vector as described above under conditions to provide for expression 
by the vector of a coding sequence encoding the protein, and optionally 
recovering the expressed protein. 

The vectors may be for example, plasmid or virus vectors provided with an 
origin of replication, optionally a promoter for the expression of the said 
polynucleotide and optionally a regulator of the promoter. The vectors may 
contain one or more selectable marker genes, for example an ampicillin 
resistance gene in the case of a bacterial plasmid or a neomycin resistance gene 
for a mammalian vector. Vectors may be used, for example, to transfect or 
transform a host cell. 

Control sequences operably linked to sequences encoding the protein of 
the invention include promoters/enhancers and other expression regulation 
signals. These control sequences may be selected to be compatible with the host 
cell for which the expression vector is designed to be used in. The term 
"promoter" is well-known in the art and encompasses nucleic acid regions ranging 
in size and complexity from minimal promoters to promoters including upstream 
elements and enhancers. 

The promoter is typically selected from promoters which are functional in 
mammalian cells, although prokaryotic promoters and promoters functional in 
other eukaryotic cells may be used. The promoter is typically derived from 
promoter sequences of viral or eukaryotic genes. For example, it may be a 
promoter derived from the genome of a cell in which expression is to occur. With 
respect to eukaryotic promoters, they may be promoters that function in a 
ubiquitous manner (such as promoters of alpha-actin, beta-actin, tubulin) or, 
alternatively, a tissue-specific manner (such as promoters of the genes for 
pyruvate kinase). Tissue-specific promoters specific for particular cells may be 
used if appropriate. They may also be promoters that respond to specific stimuli 
for example promoters that bind steroid hormone receptors. Viral promoters may 
also be used, for example the Moloney murine leukaemia virus long terminal 
repeat (MMLV LTR) promoter, the rous sarcoma virus (RSV) LTR promoter or the 
human cytomegalovirus (CMV) IE promoter 
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It may also be advantageous for the promoters to be inducible so that the 
levels of expression of the heterologous gene can be regulated during the life- 
time of the cell. Inducible means that the levels of expression obtained using the 
promoter can be regulated. 
5 In addition, any of these promoters may be modified by the addition of 

further regulatory sequences, for example enhancer sequences. Chimeric 
promoters may also be used comprising sequence elements from two or more 
different promoters described above. 

10 Host cells 

Vectors and polynucleotides of the invention may be introduced into host 
ceils for the purpose of replicating the vectors/polynucleotides and/or expressing 
the proteins of the invention encoded by the polynucleotides of the invention. 
Although the proteins of the invention may be produced using prokaryotic cells as 
host cells, it is preferred to use eukaryotic cells, for example yeast, insect or 
mammalian cells, in particular mammalian cells. 

Vectors/polynucleotides of the invention may introduced into suitable host 
cells using a variety of techniques known in the art, such as transfection, 
transformation and electroporation. Where vectors/polynucleotides of the 
invention are to be administered to animals, several techniques are known in the 
art, for example infection with recombinant viral vectors such as retroviruses, 
herpes simplex viruses and adenoviruses, direct injection of nucleic acids and 
biolistic transformation. 

25 Polypeptides 

Polypeptides of the invention comprise polypeptides encoded by 
polynucleotides of the invention. It will be understood that polypeptide sequences 
of the invention include sequences homologous to the polypeptides encoded by 
the nucleotide sequences referred to in the Tables, which may be obtained from 
any source, for example related viral/bacterial proteins, cellular homologues and 
synthetic peptides, as well as variants or derivatives thereof. 

Thus, the present invention encompasses variants, homologues or 
derivatives of the amino acid sequences of the present invention, as well as 
variants, homologues or derivatives of the amino acid sequences coded for by the 
nucleotide sequences of the present invention. In one embodiment, homologues 
are naturally occurring sequences, such as orthologues, tissue-specific isoforms 
and allelic variants. 
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In the context of the present invention, a homologous sequence is taken to 
include an amino acid sequence which is at least 60, 70, 80 or 90% identical, 
preferably at least 95 or 98% identical at the amino acid level over at least 20, 40^ 
60 or 80 amino acids with a sequence encoded by a nucleotide sequence 
5 referred to in the Tables. In particular, homology should typically be considered 
with respect to those regions of the sequence known to be essential for specific 
biological functions rather than non-essential neighbouring sequences. 

Although amino acid homology can also be considered in terms of similarity 
(i.e. amino acid residues having similar chemical properties/functions), in the 
10 context of the present invention it is preferred to express homology in terms of 
sequence identity. 

Homology comparisons can be carried out as described above for nucleotide 
sequences with the appropriate modifications for amino acid sequences. For 
example when using the GCG Wisconsin Bestfit package (see below) the default 

15 gap penalty for amino acid sequences is -1 2 for a gap and -4 for each extension. 

It should also be noted that where computer algorithms are used to align 
amino acid sequences, although the final % homology can„be measured in terms 
of identity, the alignment process itself is typically not based on an all-or-nothing 
pair comparison. Instead, a scaled similarity score matrix is generally used that 

20 assigns scores to each pairwise comparison based on chemical similarity or 
evolutionary distance. An example of such a matrix commonly used is the 
BLOSUM62 matrix - the default matrix for the BLAST suite of programs. GCG 
Wisconsin programs generally use either the public default values or a custom 
symbol comparison table if supplied (see user manual for further details). It is 

25 preferred to use the public default values for the GCG package, or in the case of 
other software, the default matrix, such as BLOSUM62. 

The terms "variant" or "derivative" in relation to the amino acid sequences of 
the present invention includes any substitution of, variation of, modification of, 
replacement of, deletion of or addition of one (or more) amino acids from or to the 

30 sequence providing the resultant amino acid sequence preferably has biological 
activity, preferably having at least 25 to 50% of the activity as the polypeptides 
referred to in the sequence listings, more preferably at least substantially the same 
activity. Particular details of biological activity for each polypeptide are given in the 
Tables. 

35 Thus, the polypeptides referred to in the Tables and homologues thereof, 

may be modified for use in the present invention. Typically, modifications are 
made that maintain the activity of the sequence. Thus, in one embodiment, amino 
acid substitutions may be made, for example from 1, 2 or 3 to 10, 20 or 30 
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substitutions provided that the modified sequence retains at least about 25 to 
50% of, or substantially the same activity. However, in an alternative 
embodiment, modifications to the amino acid sequences of a polypeptide of the 
invention may be made intentionally to reduce the biological activity of the 
polypeptide. For example truncated polypeptides that remain capable of binding 
to target molecules but lack functional effector domains may be useful as 
inhibitors of the biological activity of the full length molecule. 

In general, preferably less than 20%, 10% or 5% of the amino add residues 
of a variant or derivative are altered as compared with the corresponding region of 
the polypeptides referred to in the Tables. 

Amino acid substitutions may include the use of non-naturally occurring 
analogues, for example to increase blood plasma half-life of a therapeutically 
administered polypeptide (see below for further details on the production of 
peptide derivatives for use in therapy). 

Conservative substitutions may be made, for example according to the 
Table below. Amino acids in the same block in the second column and preferably 
in the same line in the third column may be substituted for each other: 



ALIPHATIC 


Non-polar 


GAP 






ILV 




Polar - uncharged 


CSTM 






NQ 




Polar - charged 


DE 






KR 


AROMATIC 




HFWY 



Polypeptides of the invention also include fragments of the above mentioned 
full length polypeptides and variants thereof, including fragments of the sequences 
referred to in the Tables and homologues thereof. Preferred fragments include 
those which include an epitope. Suitable fragments will be at least about 6 or 8, e.g. 
at least 10, 12, 15 or 20 amino acids in length. They may also be less than 200, 100 
or 50 amino acids in length. Polypeptide fragments may contain one or more (e.g. 2, 
3, 5, or 10) substitutions, deletions or insertions, including conserved substitutions. 
Where substitutions, deletion and/or insertions have been made, for example by 
means of recombinant technology, preferably less than 20%, 10% or 5% of the 
amino acid residues depicted in the sequence listings are altered. 
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Proteins of the invention are typically made by recombinant means, for 
example as described below. However they may also be made by synthetic 
means using techniques well known to skilled persons such as solid phase 
synthesis. Various techniques for chemically synthesising peptides are reviewed 

5 by Borgia and Fields, 2000, TibTech 18: 243-251 and described in detail in the 
references contained therein. 

Proteins of the invention may also be produced as fusion proteins, for 
example to aid in extraction and purification. Examples of fusion protein partners 
include glutathione-S-transferase (GST), hexahistidine, GAL4 (DNA binding 

10 and/or transcriptional activation domains) and |J-galactosidase. It may also be 
convenient to include a proteolytic cleavage site between the fusion protein 
partner and the protein sequence of interest to allow removal of fusion protein 
sequences. Preferably the fusion protein will not hinder the function of the protein 
of interest sequence. 

15 Proteins of the invention may be in a substantially isolated form; It will be 

understood that the protein may be mixed with carriers or diluents which will not 
interfere with the intended purpose of the protein and still be regarded as 
substantially isolated. A protein of the invention may also be in a substantially 
purified form, in which case it will generally comprise the protein in a preparation 

20 in which more than 90%, e.g, 95%, 98% or 99% of the protein in the preparation 
is a protein of the invention. 

Characteristics of ovarian cancer-associated proteins 

Ovarian cancer proteins of the present invention may be classified as 
25 secreted proteins, transmembrane proteins or intracellular proteins. In one 
embodiment, the ovarian cancer protein is an intracellular protein. Intracellular 
proteins may be found in the cytoplasm and/or in the nucleus. Intracellular 
proteins are involved in all aspects of cellular function and replication (including, 
e.g., signaling pathways); aberrant expression of such proteins often results in 
30 unregulated or disregulated cellular processes (see, e.g., Molecular Biology of the 
Cell (Alberts, ed., 3rd ed., 1994). For example, many intracellular proteins have 
enzymatic activity such as protein kinase activity, protein phosphatase activity, 
protease activity, nucleotide cyclase activity, polymerase activity and the like. 
Intracellular proteins also serve as docking proteins that are involved in 
35 organizing complexes of proteins, or targeting proteins to. various subcellular 
localizations, and are involved in maintaining the structural integrity of organelles. 

An increasingly appreciated concept in characterising proteins is the 
presence in the proteins of one or more motifs for which defined functions have 
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been attributed. In addition to the highly conserved sequences found in the 
enzymatic domain of proteins, highly conserved sequences have been identified 
in proteins that are involved in protein-protein interaction. For example, 
Src-homology-2 (SH2) domains bind tyrosine-phosphorylated targets in a 

5 sequence dependent manner. PTB domains, which are distinct from SH2 
domains, also bind tyrosine phosphorylated targets. SH3 domains bind to 
proline-rich targets. In addition, PH domains, tetratricopeptide repeats and WD 
domains to name only a few, have been shown to mediate protein-protein 
interactions. Some of these may also be involved in binding to phospholipids or 

10 other second messengers. As will be appreciated by one of ordinary skill in the 
art, these motifs can be identified on the basis of primary sequence; thus, an 
analysis of the sequence of proteins may provide insight into both the enzymatic 
potential of the molecule and/or molecules with which the protein may associate. 
One useful database is Pfam (protein families), which is a large collection of 

15 multiple sequence alignments and hidden Markov models covering many 
common protein domains. Versions are available via the internet from 
Washington University in St. Louis, the Sanger Center in England, and the 
Karolinska Institute in Sweden (see, e.g., Bateman et al., 2000, Nuc. Acids Res. 
28: 263-266; Sonnhammer et al., 1997, Proteins 28: 405-420; Bateman et al., 

20 1999, Nuc. Acids Res. 27:260-262; and Sonnhammer et al., 1998, Nuc. Acids 
Res. 26: 320-322. 

In another embodiment, the ovarian cancer sequences are transmembrane 
proteins. Transmembrane proteins are molecules that span a phospholipid bilayer 
of a cell. They may have an intracellular domain, an extracellular domain, or both. 

25 The intracellular domains of such proteins may have a number of functions 
including those already described for intracellular proteins. For example, the 
intracellular domain may have enzymatic activity and/or may serve as a binding 
site for additional proteins. Frequently the intracellular domain of transmembrane 
proteins serves both roles. For example certain receptor tyrosine kinases have 

30 both protein kinase activity and SH2 domains. In addition, autophosphorylation of 
tyrosines on the receptor molecule itself, creates binding sites for additional SH2 
domain containing proteins. 

Transmembrane proteins may contain from one to many transmembrane 
domains. For example, receptor tyrosine kinases, certain cytokine receptors, 

35 receptor guanylyl cyclases and receptor serine/threonine protein kinases contain 
■a single transmembrane domain. However, various other proteins including 
channels and adenylyl cyclases contain numerous transmembrane domains. 
Many important cell surface receptors such as G protein coupled receptors 
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(GPCRs) are classified as "seven transmembrane domain" proteins, as they 
contain 7 membrane spanning regions. Characteristics of transmembrane 
domains include approximately 20 consecutive hydrophobic amino acids that may 
be followed by charged amino acids. Therefore, upon analysis of the amino acid 
sequence of a particular protein, the localization and number of transmembrane 
domains within the protein may be predicted (see, e.g. PSORT web site 
http://psort.nibb.ac.jp/). Important transmembrane protein receptors include but 
are not limited to the insulin receptor, insulin-like growth factor receptor, human 
growth hormone receptor, glucose transporters, transferrin receptor, epidermal 
growth factor receptor, low density lipoprotein receptor, epidermal growth factor 
receptor, leptin receptor, interleukin receptors, e.g. IL-1 receptor, IL-2 receptor, 

The extracellular domains of transmembrane proteins are diverse- 
however, conserved motifs are found repeatedly among various extracellular 
domains. Conserved structure and/or functions have been ascribed to different 
extracellular motifs. Many extracellular domains are involved in binding to other 
molecules. In one aspect, extracellular domains are found on receptors Factors 
that bind the receptor domain include circulating ligands, which may be peptides 
proteins, or small molecules such as adenosine and the like. For example growth 
factors such as EGF, FGF and PDGF are circulating growth factors that bind to 
20 their cognate receptors to initiate a variety of cellular responses. Other factors 
include cytokines, mitogenic factors, neurotrophic factors and the like 
Extracellular domains also bind to cell-associated molecules. In this respect they 
mediate cell-cell interactions., Cell-associated ligands can be tethered to the cell 
e.g., via a glycosylphosphatidylinositol (GPI) anchor, or may themselves be 
25 transmembrane proteins. Extracellular domains also associate with the 
extracellular matrix and contribute to the maintenance of the cell structure 

Ovarian cancer proteins that are transmembrane are particularly preferred 
in the present • invention as they are readily accessible targets for 
immunotherapeutics, as are described herein. In addition, as outlined below 
30 transmembrane proteins can be also useful in imaging modalities. Antibodies may 
be used to label such readily accessible proteins in situ. Alternatively, antibodies 
can also label intracellular proteins, in which case samples are typically 
permeabhzed to provide access to intracellular proteins. 

It will also be appreciated by those in the art that a transmembrane protein 
can be made soluble by removing transmembrane sequences, e.g., through 
recombinant methods. Furthermore, transmembrane proteins that have been 
made soluble can be made to be secreted through recombinant means by adding 
an appropriate signal sequence. 
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In another embodiment, the ovarian cancer proteins are secreted proteins; 
the secretion of which can be either constitutive or regulated. These proteins 
have a signal peptide or signal sequence that targets the molecule to the 
secretory pathway. Secreted proteins are involved in numerous physiological 
5 events; by virtue of their circulating nature, they serve to transmit signals to 
various other cell types. The secreted protein may function in an autocrine 
manner (acting on the cell that secreted the factor), a paracrine manner (acting 
on cells in close proximity to the cell that secreted the factor) or an endocrine 
manner (acting on cells at a distance). Thus secreted molecules find use in 
10 modulating or altering numerous aspects of physiology. Ovarian cancer proteins 
that are secreted proteins are particularly preferred in the present invention as 
they serve as good targets for diagnostic markers, e.g., for blood, plasma, serum, 
or stool tests. 
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15 Therapeutic peptides 

Polypeptides/peptides of the present invention may be administered 
therapeutically to patients. It is preferred to use peptides that do not consisting 
solely of naturally-occurring amino acids but which have been modified, for 
example to reduce immunogenicity, to increase circulatory half-life in the body of 
the patient, to enhance bioavailability and/or to enhance efficacy and/or 
specificity. 

A number of approaches have been used to modify peptides for 
therapeutic application. One approach is to link the peptides or proteins to a 
variety of polymers, such as polyethylene glycol (PEG) and polypropylene glycol 
(PPG) - see for example U.S. Patent Nos. 5,091,176, 5,214,131 and US 
5,264,209. 

Replacement of naturally-occurring amino acids with a variety of uncoded 
or modified amino acids such as D-amino acids and N-methyl amino acids may 
also be used to modify peptides 
30 Another approach is to use bifunctional crosslinkers, such as N- 

succinimidyl 3-(2 pyridyldithio) propionate, succinimidyl 6-[3-(2 pyridyldithio) 
propionamido] hexanoate, and sulfosuccinimidyl 6-[3-(2 pyridyldithio) 
propionamido]hexanoate (see US Patent 5,580,853). 

It may be desirable to use derivatives of the peptides of the invention which 
are conformationally constrained. Conformational constraint refers to the stability 
and preferred conformation of the three-dimensional shape assumed by a 
peptide. Conformational constraints include local constraints, involving restricting 
the conformational mobility of a single residue in a peptide; regional constraints, 
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involving restricting the conformational mobility of a group of residues, which 
residues may form some secondary structural unit; and global constraints, 
involving the entire peptide structure. 

The active conformation of the peptide may be stabilized by a covalent 
5 modification, such as cyclization or by incorporation of gamma-lactam or other 
types of bridges. For example, side chains can be cyclized to the backbone so as 
create a L-gamma-lactam moiety on each side of the interaction site. See, 
generally, Hruby et al., "Applications of Synthetic Peptides," in Synthetic 
Peptides: A User's Guide: 259-345 (W. H. Freeman & Co. 1992). Cyclization also 

10 can be achieved, for example, by formation of cystine bridges, coupling of amino 
and carboxy terminal groups of respective terminal amino acids, or coupling of 
the amino group of a Lys residue or a related homolog with a carboxy group of 
Asp, Glu or a related homolog. Coupling of the .alpha-amino group of a 
polypeptide with the epsilon-amino group of a lysine residue, using iodoacetic 

15 anhydride, can be also undertaken. See Wood and Wetzel, 1992, Infl J. Peptide 
Protein Res. 39: 533-39. 

Another approach described in US 5,891,418 is to include a metal-ion 
complexing backbone in the peptide structure. Typically, the preferred metal- 
peptide backbone is based on the requisite number of particular coordinating 

20 groups required by the coordination sphere of a given complexing metal ion. In 
general, most of the metal ions that may prove useful have a coordination number 
of four to six. The nature of the coordinating groups in the peptide chain includes 
nitrogen atoms with amine, amide, imidazole, or guanidino functionalities; sulfur 
atoms of thiols or disulfides; and oxygen atoms of hydroxy, phenolic, carbonyl, or 

25 carboxyl functionalities. In addition, the peptide chain or individual amino acids 
can be chemically altered to include a coordinating group, such as for example 
oxime, hydrazino, sulfhydryl, phosphate, cyano, pyridine piperidino, or 
morpholino. The peptide construct can be either linear or cyclic, however a linear 
construct is typically preferred. One example of a small linear peptide is Gly-Gly- 

30 Gly-Gly which has four nitrogens (an N 4 complexation system) in the back bone 
that can complex to a metal ion with a coordination number of four. 

A further technique for improving the properties of therapeutic peptides is 
to use non-peptide peptidomimetics. A wide variety of useful techniques may be 
used to elucidating the precise structure of a peptide. These techniques include 

35 amino acid sequencing, x-ray crystallography, mass spectroscopy, nuclear 
magnetic resonance spectroscopy, computer-assisted molecular modeling, 
peptide mapping, and combinations thereof. Structural analysis of a peptide 
generally provides a large body of data which comprise the amino acid sequence 
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of the peptide as well as the three-dimensional positioning of its atomic 
components. From this information, non-peptide peptidomimetics may be 
designed that have the required chemical functionalities for therapeutic activity 
but are more stable, for example less susceptible to biological degradation. An 
example of this approach is provided in US 5,81 1 ,51 2. 

Techniques for chemically synthesising therapeutic peptides of the 
invention are described in the above references and also reviewed by Borgia and 
Fields, 2000, TibTech 18: 243-251 and described in detail in the references 
contained therein. 



Protein Expression and Purification 

Host cells comprising polynucleotides of the invention may be used to 
express proteins of the invention. Host cells may be cultured under suitable 
conditions which allow expression of the proteins of the invention. Expression of 
the proteins of the invention may be constitutive such that they are continually 
produced, or inducible, requiring a stimulus to initiate expression. In the case of 
inducible expression, protein production can be initiated when required by, for 
example, addition of an inducer substance to the culture medium, for example 
dexamethasone or IPTG. 

Proteins of the invention can be extracted from host cells by a variety of 
techniques known in the art, including enzymatic, chemical and/or osmotic lysis 
and physical disruption. 

Polypeptides of the invention may also be produced recombinantly in an in 
vitro cell-free system, such as the TnT™ (Promega) rabbit reticulocyte system or 
25 a wheat germ lysate system. 

Antibodies 

The invention also provides monoclonal or polyclonal antibodies that bind 
specifically to polypeptides of the invention or fragments thereof. Thus, the present 

30 invention further provides a process for the production of monoclonal or polyclonal 
antibodies to polypeptides of the invention. 

The phrase "binds specifically" to a polypeptide means that the binding of 
the antibody to the protein or peptide is determinative of the presence of the 
protein, in a heterogeneous population of proteins and other biologies. Thus, 

35 under designated immunoassay conditions, the specified antibodies bind to a 
particular protein at least two times the background and more typically more than 
10 to 100 times background. Typically, antibodies of the invention bind to a protein 
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of interest with a Kd of at least about 0.1 mM, more usually at least about 1 jiM, 
preferably at least about 0.1 ^M, and most preferably at least, 0.01 jaM. 

If polyclonal antibodies are desired, a selected mammal (e.g., mouse, 
rabbit, goat, horse, etc.) is immunised with an immunogenic polypeptide bearing a 
5 epitope(s). Serum from the immunised animal is collected and treated according 
to known procedures. If serum containing polyclonal antibodies to a particular 
polypeptide epitope of interest contains antibodies to other antigens, the 
polyclonal antibodies can be purified by immunoaffinity chromatography. 
Techniques for producing and processing polyclonal antisera are known in the 

10 art. In order that such antibodies may be made, the invention also provides 
polypeptides of the invention or fragments thereof haptenised to another 
polypeptide for use as immunogens in animals or humans. 

Monoclonal antibodies directed against particular epitopes in the 
polypeptides of the invention can also be readily produced by one skilled in the 

15 art. The general methodology for making monoclonal antibodies by hybridomas is 
well known. Immortal antibody-producing cell lines can be created by cell fusion, 
and also by other techniques such as direct transformation of B lymphocytes with 
oncogenic DNA, or transfection with Epstein-Barr virus. Panels of monoclonal 
antibodies produced against particular epitopes can be screened for various 

20 properties; i.e., for isotype and epitope affinity. 

An alternative technique involves screening phage display libraries where, 
for example the phage express scFv fragments on the surface of their coat with a 
large variety of complementarity determining regions (CDRs). This technique is 
well known in the art. 

25 Antibodies, both monoclonal and polyclonal, which are directed against 

polypeptides of the invention or fragments thereof are particularly useful in 
diagnosis, and those which are neutralising are useful in passive immunotherapy. 
Monoclonal antibodies, in particular, may be used to raise anti-idiotype 
antibodies. Anti-idiotype antibodies are immunoglobulins which carry an "internal 
30 image" of the antigen of the agent against which protection is desired. 

Techniques for raising anti-idiotype antibodies are known in the art. These 
anti-idiotype antibodies may also be useful in therapy. 

For the purposes of this invention, the term "antibody", unless specified to the 
contrary, includes fragments of whole antibodies which retain their binding activity 
35 for a target antigen. Such fragments include Fv, F(ab') and F(ab') 2 fragments, as 
. well as single chain antibodies (scFv). Furthermore, the antibodies and fragments 
thereof may be humanised antibodies, for example as described in EP-A-239400. 
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Antibodies may be used in method of detecting polypeptides of the invention 
present in biological samples by a method which comprises: 

(a) providing an antibody of the invention; 

(b) incubating a biological sample with said antibody under conditions 
5 which allow for the formation of an antibody-antigen complex; and 

(c) determining whether an antibody-antigen complex comprising said 
antibody is formed. 

Such tests for detecting polypeptides of the invention in a biological sample 
10 may be used as part of the methods of the invention for detecting ovarian cancer- 
associated polypeptides and monitoring the efficacy of treatment of patients 
suffering from ovarian cancer as will be described in more detail below. 

Antibodies of the invention may be bound to a solid support and/or packaged 
into kits in a suitable container along with suitable reagents, controls, instructions 
15 and the like. 

Construction of Transgenic Animals Expressing Polypeptides of the 
Invention and "Knock-Out" Animals 

The present invention also provides transgenic animals which are 

20 transgenic by virtue of comprising a polynucleotide of the invention, i.e. animals 
transformed with a polynucleotide of the invention. Suitable animals are generally 
from the phylum chordata. Chordates includes vertebrate groups such as 
mammals, birds, reptiles and amphibians. Particular examples of mammals 
include non-human primates, cats, dogs, ungulates such as cows, goats, pigs, 

25 sheep and horses and rodents such as mice, rats, gerbils and hamsters. 
Transgenic animals within the meaning of the present invention are non-human 
animals and the production of transgenic humans is specifically excluded. 

Techniques for producing transgenic animals are well known in the art. A 
useful general textbook on this subject is Houdebine, Transgenic animals - 

30 Generation and Use (Harwood Academic, 1997) - an extensive review of the 
techniques used to generate transgenic animals from fish to mice and cows. 

Advances in technologies for embryo micromanipulation now permit 
introduction of heterologous DNA into, for example, fertilized mammalian ova. For 
instance, totipotent or pluripotent stem cells can be transformed by microinjection, 

35 calcium phosphate mediated precipitation, liposome fusion, retroviral infection or 
other means, the transformed cells are then introduced, into the embryo, and the 
embryo then develops into a transgenic animal. In a highly preferred method, 
developing embryos are infected with a retrovirus containing the desired DNA 
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and transgenic animals produced from the infected embryo. In a most preferred 
method, however, the appropriate DNAs are coinjected into the pronucleus or 
cytoplasm of embryos, preferably at the single cell stage, and the embryos 
allowed to develop into mature transgenic animals. Those techniques as well 

5 known. See reviews of standard laboratory procedures for microinjection of 
heterologous DNAs into mammalian fertilized ova, including Hogan ef a/., 
Manipulating the Mouse Embryo, (Cold Spring Harbor Press 1986); Krimpenfort 
ef a/., Bio/Technology 9:844 (1991); Palmiter ef a/., Cell; 41: 343 (1985); Kraemer 
ef a/., Genetic manipulation of the Mammalian Embryo, (Cold Spring Harbor 

io Laboratory Press 1985); Hammer ef a/., Nature, 315: 680 (1985); Wagner ef a/., 
U.S. Pat No. 5,175,385; Krimpenfort et a/., U.S. Pat. No. 5,175,384, the 
respective contents of which are incorporated herein by reference 

Another method used to produce a transgenic animal involves 
microinjecting a nucleic acid into pro-nuclear stage eggs by standard methods. 

15 Injected eggs are then cultured before transfer into the oviducts of 
pseudopregnant recipients. 

Transgenic animals may also be produced by nuclear transfer technology 
as described in Schnieke, A.E. ef a/., 1997, Science, 278: 2130 and Cibelli, J.B. 
ef a/., 1998, Science, 280: 1256. Using this method, fibroblasts from donor 

20 animals are stably transfected with a plasmid incorporating the coding sequences 
for a binding domain or binding partner of interest under the control of regulatory. 
Stable transfectants are then fused to enucleated oocytes, cultured and 
transferred into female recipients. 

Analysis of animals which may contain transgenic sequences would 

25 typically be performed by either PCR or Southern blot analysis following standard 
methods. 

By way of a specific example for the construction of transgenic mammals, 
such as cows, nucleotide constructs comprising a sequence encoding a binding 
domain fused to GFP are microinjected using, for example, the technique 
30 described in U.S. Pat. No. 4,873,191, into oocytes which are obtained from 
ovaries freshly removed from the mammal. The oocytes are aspirated from the 
follicles and allowed to settle before fertilization with thawed frozen sperm 
capacitated with heparin and prefractionated by Percoll gradient to isolate the 
motile fraction. 

35 The fertilized oocytes are centrifuged, for example, for eight minutes at 

15,000 g to visualize the pronuclei for injection and then cultured from the zygote 
to morula or blastocyst stage in oviduct tissue-conditioned medium. This medium 
is prepared by using luminal tissues scraped from oviducts and diluted in culture 
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medium. The zygotes must be placed in the culture medium within two hours 
following microinjection. 

Oestrous is then synchronized in the intended recipient mammals, such as 
cattle, by administering coprostanol. Oestrous is produced within two days and 

5 the embryos are transferred to the recipients 5-7 days after estrous. Successful 
transfer can be evaluated in the offspring by Southern blot. 

Alternatively, the desired constructs can be introduced into embryonic 
stem cells (ES cells) and the cells cultured to ensure modification by the 
transgene. The modified cells are then injected into the blastula embryonic stage 

10 and the blastulas replaced into pseudopregnant hosts. The resulting offspring are 
chimeric with respect to the ES and host cells, and nonchimeric strains which 
exclusively comprise the ES progeny can be obtained using conventional cross- 
breeding. This technique is described, for example, in WO91/10741. 

In another embodiment, transgenic animals of the present invention are 

15 transgenic "knock-out" animals where a specific gene corresponding to a 
polynucleotide referred to in the Tables has been rendered non-functional by 
homologous recombination. The generation of "knock-out" animals is similar to 
the production of other transgenic animals except that the polynucleotide 
constructs are designed to integrate into the endogenous genes and disrupt the 

20 function of the endogenous sequences. The generation of "knock-ouf animals is 
known in the art, including the design of suitable constructs that will recombine at 
the appropriate site in the genome. 

In one embodiment, the heterologous sequence which it is desired to 
recombine into the genome of a target animal comprises a functional sequence 

25 but under the control of an inducible promoter so that expression of the gene can 
be regulated by administration of an endogenous molecule. This may be 
advantageous where disruption of the gene is embryonic-lethal. 

"Knock-out* animals can be used as animal models for the study of gene 
function. 

30 

Informatics 

The ability to identify genes that are over or under expressed in ovarian 
cancer can additionally provide high-resolution, high-sensitivity datasets which 
can be used in the areas of diagnostics, therapeutics, drug development, 
35 pharmacogenetics, protein structure, biosensor development, and other related 
areas. For example, the expression profiles can be used in diagnostic or 
prognostic evaluation of patients with ovarian cancer. Or as another example, 
subcellular toxicological information can be generated to better direct drug 
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structure and activity correlation (see Anderson, Pharmaceutical Proteomics: 
Targets, Mechanism, and Function, paper presented at the IBC Proteomics 
conference, Coronado, CA (June 11-12, 1998)). Subcellular toxicological 
information can also be utilized in a biological sensor device to predict the likely 
5 toxicological effect of chemical exposures and likely tolerable exposure 
thresholds (see U.S. Patent No. 5,811,231). Similar advantages accrue from 
datasets relevant to other biomolecules and bioactive agents (e.g., nucleic acids, 
saccharides,' lipids, drugs, and the like). 

Thus, in another embodiment, the present invention provides a database 
10 that includes at least one set of assay data. The data contained in the database is 
acquired, e.g., using array analysis either singly or in a library format. The 
database can be in substantially any form in which data can be maintained and 
transmitted, but is preferably an electronic database. The electronic database of 
the invention can be maintained on any electronic device allowing for the storage 
15 of and access to the database, such as a personal computer, but is preferably 
distributed on a wide area network, such as the World Wide Web. 

The focus of the present section on databases that include peptide 
sequence data is for clarity of illustration only. It will be apparent to those of skill 
in the art that similar databases can be assembled for any assay data acquired 
20 using an assay of the invention. 

The compositions and methods for identifying and/or quantitating the 
relative and/or. absolute abundance of a variety of molecular and macromolecular 
species from a biological sample undergoing ovarian cancer, i.e., the 
identification of ovarian cancer-associated sequences described herein, provide 
25 an abundance of information, which can be correlated with pathological 
conditions, predisposition to disease, drug testing, therapeutic monitoring, 
gene-disease causal linkages, identification of correlates of immunity and 
physiological status, among others. Although the data generated from the assays 
of the invention is suited for manual review and analysis, in a preferred 
30 embodiment, prior data processing using high-speed computers is utilized. 

An array of methods for indexing and retrieving biomolecular information is 
known in the art. For example, U.S. Patents 6,023,659 and 5,966,712 disclose a 
relational database system for storing biomolecular sequence information in a 
manner that allows sequences to be catalogued and searched according to one 
or more protein function hierarchies. U.S. Patent 5,953,727 discloses a relational 
database having sequence records containing information in a format that allows 
a collection of partial-length DNA sequences to be catalogued and searched 
according to association with one or more sequencing projects for obtaining 
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full-length sequences from the collection of partial length sequences. U.S. Patent 
5,706,498 discloses a gene database retrieval system for making a retrieval of a 
gene sequence similar to a sequence data item in a gene database based on the 
degree of similarity between a key sequence and a target sequence. U.S. Patent 
5,538,897 discloses a method using mass spectroscopy fragmentation patterns of 
peptides to identify amino acid sequences in computer databases by comparison 
of predicted mass spectra with experimentally-derived mass spectra using a 
closeness-of-fit measure. U.S. Patent 5,926,818 discloses a multi-dimensional 
database comprising a functionality for multi-dimensional data analysis described 
as on-line analytical processing (OLAP), which entails the consolidation of 
projected and actual data according to more than one consolidation path or 
dimension. U.S. Patent 5,295,261 reports a hybrid database structure in which 
the fields of each database record are divided into two classes, navigational and 
informational data, with navigational fields stored in a hierarchical topological map 
which can be viewed as a tree structure or as the merger of two or more such 
tree structures. 

See also Mount ef a/., Bioinformatics (2001); Biological Sequence Analysis- 
Probabilistic Models of Proteins and Nucleic Acids (Durbin ef al., eds 1999)- 
Bioiraformatics: A Practical Guide to the Analysis of Genes and Proteins 
(Baxevanis & Oeullette eds., 1998)); Rashidi & Buehler, Bioinformatics: Basic 
Applications in Biological Science and Medicine (1999); Introduction to 
Computational Molecular Biology (Setubal et al., eds 1997); Bioinformatics- 
Methods and Protocols (Misener & Krawetz, eds, 2000); Bioinformatics- 
Sequence, Structure, and Databanks: A Practical Approach (Higgins & Taylor 
eds., 2000); Brown, Bioinfor7natics: A Biologist's Guide to Biocomputing and the 
Internet (2001); Han & Kamber, Data Mining: Concepts and Techniques (2000)- 
and Waterman, Introduction to Computational Biology: Maps, Sequences and 
Genomes (1995). 

The present invention provides a computer database comprising a 
computer and software for storing in computer-retrievable form assay data 
records cross-tabulated, e.g., with data specifying the source of the 
target-containing sample from which each sequence specificity record was 
obtained. 

In an exemplary embodiment, at least one of the sources of 
target-containing sample is from a control tissue sample known to be free of 
pathological disorders. In a variation, at least one of the sources is a known 
pathological tissue specimen, e.g., a neoplastic lesion or another tissue specimen 
to be analyzed for prostate cancer. In another variation, the assay records 
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cross-tabulate one or more of the following parameters for each target species in 
a sample: (1) a unique identification code, which can include, e.g., a target 
molecular structure and/or characteristic separation coordinate (e.g., 
electrophoretic coordinates); (2) sample source; and (3) absolute and/or relative 

5 quantity of the target species present in the sample. 

The invention also provides for the storage and retrieval of a collection of 
target data in a computer data storage apparatus, which can include magnetic 
disks, optical disks, magneto-optical disks, DRAM, SRAM, SGRAM, SDRAM, 
RDRAM, DDR RAM, magnetic bubble memory devices, and other data storage 

10 devices, including CPU registers and on-CPU data storage arrays. Typically, the 
target data records are stored as a bit pattern in an array of magnetic domains on 
a magnetizable medium or as an array of charge states or transistor gate states, 
such as an array of cells in a DRAM device (e.g., each cell comprised of a 
transistor and a charge storage area, which may be on the transistor). In one 

is embodiment, the invention provides such storage devices, and computer systems 
built therewith, comprising a bit pattern encoding a protein expression fingerprint 
record comprising unique identifiers for at least 10 target data records 
cross-tabulated with target source. 

When the target is a peptide or nucleic acid, the invention preferably 

20 provides a method for identifying related peptide or nucleic acid sequences, 
comprising performing a computerised comparison between a peptide or nucleic 
acid sequence assay record stored in or retrieved from a computer storage 
device or database and at least one other sequence. The comparison can include 
a sequence analysis or comparison algorithm or computer program embodiment 

25 thereof (e.g., BLAST, FASTA, TFASTA, GAP, BESTFIT - see above) and/or the 
comparison may be of the relative amount of a peptide or nucleic acid sequence 
in a pool of sequences determined from a polypeptide or nucleic acid sample of a 
specimen. 

The invention also preferably provides a magnetic disk, such as an IBM- 
30 compatible (DOS, Windows, Windows95/.98/2000, Windows NT, OS/2) or other 
format (e.g., Linux, SunOS, Solaris, AIX, SCO Unix, VMS, MV, Macintosh, etc.) 
floppy diskette or hard (fixed, Winchester) disk drive, comprising a bit pattern 
encoding data from an assay of the invention in a file format suitable for retrieval 
and processing in a computerized sequence analysis, comparison, or relative 
35 quantitation method. 

The invention also provides a network, comprising a plurality of computing 
devices linked via a data link, such as an Ethernet cable (coax or lOBaseT), 
telephone line, ISDN line, wireless network, optical fiber, or other suitable signal 
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transmission medium, whereby at least one network device (e.g., computer, disk 
array, etc.) comprises a pattern of magnetic domains (e.g., magnetic disk) and/or 
charge domains (e.g., an array of DRAM cells) composing a bit pattern encoding 
data acquired from an assay of the invention. 

5 The invention also provides a method for transmitting assay data that 

includes generating an electronic signal on an electronic communications device, 
such as a modem, ISDN terminal adapter, DSL, cable modem, ATM switch, or 
the like, wherein the signal includes (in native or encrypted format) a bit pattern 
encoding data from an assay or a database comprising a plurality of assay results 

10 obtained by the method of the invention. 

. In a preferred embodiment, the invention provides a computer system for 
comparing a query target to a database containing an array of data structures, 
such as an assay result obtained by the method of the invention, and ranking 
database targets based on the degree of identity and gap weight to the target 

15 data. A central processor is preferably initialized to load and execute the 
computer program for alignment and/or comparison of the assay results. Data for 
a query target is entered into the central processor via an I/O device. Execution of 
the computer program results in the central processor retrieving the assay data 
from the data file, which comprises a binary description of an assay result. 

20 The target data or record and the computer program can be transferred to 

secondary memory, which is typically random access memory (e.g., DRAM, 
SRAM, SGRAM, or SDRAM). Targets are ranked according to the degree of 
correspondence between a selected assay characteristic (e.g., binding to a 
selected affinity moiety) and the same characteristic of the query target and 

25 results are output via an I/O device. For example, a central processor can be a 
conventional computer (e.g., Intel Pentium, PowerPC, Alpha, PA-8000, SPARC, 
MIPS 4400, MIPS 10000, VAX, etc.); a program can be a commercial or public 
domain molecular biology software package (e.g., UWGCG Sequence Analysis 
Software, Darwin); a data file can be an optical or magnetic disk, a data server, a 

30 memory device (e.g., DRAM, SRAM, SGRAM, SDRAM, EPROM, bubble 
memory, flash memory, etc.); an I/O device can be a terminal comprising a video 
display and a keyboard, a modem, an ISDN terminal adapter, an Ethernet port, a 
punched card reader, a magnetic strip reader, or other suitable I/O device. 

The invention also preferably provides the use of a computer system, such 

35 as that described above, which comprises: (1) a computer; (2) a stored bit pattern 
encoding a collection of peptide sequence specificity records obtained by the 
methods of the invention, which may be stored in the computer; (3) a comparison 
target, such as a query target; and (4) a program for alignment and comparison, 
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typically with rank-ordering of comparison results on the basis of computed 
similarity values. 

Detection of ovarian cancer sequence for diagnostic and therapeutic 
5 applications 

In one aspect, the RNA expression levels of genes are determined for 
different cellular states in the ovarian cancer phenotype. Expression levels of 
genes in normal tissue (i.e., not undergoing ovarian cancer) and in ovarian cancer 
tissue (and in some cases, for varying severities of ovarian cancer that relate to 

10 prognosis, as outlined below) are evaluated to provide expression profiles. An 
expression profile of a particular cell state or point of development is essentially a 
"fingerprint" of the state. While two states may have any particular gene similarly 
expressed, the evaluation of a number of genes simultaneously allows the 
generation of a gene expression profile that is reflective of the state of the cell. By 

15 comparing expression profiles of cells in different states, information regarding 
which genes are important (including both up- and down-regulation of genes) in 
each of these states is obtained. Then, diagnosis may be performed or confirmed 
to determine whether a tissue sample has the gene expression profile of normal 
or cancerous tissue. This will provide for molecular diagnosis of related 

20 conditions. 

"Differential expression," or grammatical equivalents as used herein, refers 
to qualitative or quantitative differences in the temporal and/or cellular gene 
expression patterns within and among cells and tissue. Thus, a differentially 
expressed gene can qualitatively have its expression altered, including an 

25 activation or inactivation, in, e.g., normal versus ovarian cancer tissue. Genes 
may be turned on or turned off in a particular state, relative to another state thus 
permitting comparison of two or more states. A qualitatively regulated gene will 
exhibit an expression pattern within a state or cell type which is detectable by 
standard techniques. Some genes will be expressed in one state or cell type, but 

30 not in both. Alternatively, the difference in expression may be quantitative, e.g., in 
that expression is increased or decreased; i.e., gene expression is either 
upregulated, resulting in an increased amount of transcript, or downregulated, 
resulting in a decreased amount of transcript. The degree to which expression 
differs need only be large enough to quantify via standard characterization 

35 techniques as outlined below, such as by use of Affymetrix GeneChipTM 
expression arrays, Lockhart, Nature Biotechnology 14:1675-1680 (1996), hereby 
expressly incorporated by reference. Other techniques include, but are not limited 
to, quantitative reverse transcriptase PCR, northern analysis and RNase 
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protection. As outlined above, preferably the change in expression (i.e., 
upregulation or downregulation) is at least about 50%, more preferably at least 
about 100%, more preferably at least about 150%, more preferably at least about 
200%, with from 300 to at least 1000% being especially preferred. 
5 Evaluation may be at the gene transcript, or the protein level. The amount 

of gene expression may be monitored using nucleic acid probes to the DNA or 
RNA equivalent of the gene transcript, and the quantification of gene expression 
levels, or, alternatively, the final gene product itself (protein) can be monitored, 
e.g., with antibodies to the ovarian cancer protein and standard immunoassays 

10 (ELISAs, etc.) or other techniques, including mass spectroscopy assays, 2D gel 
electrophoresis assays, etc. Proteins corresponding to ovarian cancer genes, i.e., 
those identified as being important in a ovarian cancer phenotype, can be 
evaluated in a ovarian cancer diagnostic test. 

In a preferred embodiment, gene expression monitoring is performed on a 

15 plurality of genes. Multiple protein expression monitoring can be performed as 
well. Similarly, these assays may be performed on an individual basis as well. 

In this embodiment, the ovarian cancer nucleic acid probes are attached to 
biochips as outlined herein for the detection and quantification of ovarian cancer 
sequences in a particular cell. The assays are further described below in the 

20 example. PCR techniques can be used to provide greater sensitivity. 

In a preferred embodiment nucleic acids encoding the ovarian cancer 
protein are detected. Although DNA or RNA encoding the ovarian cancer protein 
may be detected, of particular interest are methods wherein an mRNA encoding a 
ovarian cancer protein is detected. Probes to detect mRNA can be a 

25 nucleotide/deoxynucleotide probe that is complementary to and hybridizes with 
the mRNA and includes, but is not limited to, oligonucleotides, cDNA or RNA. 
Probes also should contain a detectable label, as defined herein. In one method 
the mRNA is detected after immobilizing the nucleic acid to be examined on a 
solid support such as nylon membranes and hybridizing the probe with the 

30 sample. Following washing to remove the non-specifically bound probe, the label 
is detected. In another method detection of the mRNA is performed in situ. In this 
method permeabilized cells or tissue samples are contacted with a detectably 
labeled nucleic acid probe for sufficient time to allow the probe to hybridize with 
the target mRNA. Following washing to remove the non-specifically bound probe, 

35 the label is detected. For example a digoxygenin labeled riboprobe (RNA probe) 
that is complementary to the mRNA encoding a ovarian cancer protein is 
detected by binding the digoxygenin with an anti-digoxygenin secondary antibody 
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and developed with nitro blue tetrazolium and 5-bromo-4-chloro-3indoyl 
phosphate. 

In a preferred embodiment, various proteins from the three classes of 
proteins as described herein (secreted, transmembrane or intracellular proteins) 
5 are used in diagnostic assays. The ovarian cancer proteins, antibodies, nucleic 
acids, modified proteins and cells containing ovarian cancer sequences are used 
in diagnostic assays. This can be performed on an individual gene or 
corresponding polypeptide level. In a preferred embodiment, the expression 
profiles are used, preferably in conjunction with high throughput screening 
10 techniques to allow monitoring for expression profile genes and/or corresponding 
polypeptides. 

As described and defined herein, ovarian cancer proteins, including 
intracellular, transmembrane or secreted proteins, find use as markers of ovarian 
cancer. Detection of these proteins in putative ovarian cancer tissue allows for 

is detection or diagnosis of ovarian cancer. In one embodiment, antibodies are used 
to detect ovarian cancer proteins. A preferred method separates proteins from a 
sample by electrophoresis on a gel (typically a denaturing and reducing protein 
gel, but may be another type of gel, including isoelectric focusing gels and the 
like). Following separation of proteins, the ovarian cancer protein is detected, 

20 e.g., by immunoblotting with antibodies raised against the ovarian cancer protein. 
Methods of immunoblotting are well known to those of ordinary skill in the art 

In another preferred method, antibodies to the ovarian cancer protein find 
use in in situ imaging techniques, e.g., in histology (e.g., Methods in Cell Biology: 
Antibodies in Cell Biology, volume 37 (Asai, ed. 1993)). In this method cells are 

25 contacted with from one to many antibodies to the ovarian cancer protein(s). 
Following washing to remove non-specific antibody binding, the presence of the 
antibody or antibodies is detected. In one embodiment the antibody is detected by 
incubating with a secondary antibody that contains a detectable label. In another 
method the primary antibody to the ovarian cancer proteins) contains a detectable 

30 label, e.g. an enzyme marker that can act on a substrate. In another preferred 
embodiment each one of multiple primary antibodies contains a distinct and 
detectable label. This method finds particular use in simultaneous screening for a 
plurality of ovarian cancer proteins. As will be appreciated by one of ordinary skill 
in the art, many other histological imaging techniques are also provided by the 

35 invention. 

In a preferred embodiment the label is detected in a fluorometer which has 
the ability to detect and distinguish emissions of different wavelengths. In 
addition, a fluorescence activated cell sorter (FACS) can be used in the method. 
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In another preferred embodiment, antibodies find use in diagnosing ovarian 
cancer from blood, serum, plasma, stool, and other samples. Such samples, 
therefore, are useful as samples to be probed or tested for the presence of 
ovarian cancer proteins. Antibodies can be used to detect a ovarian cancer 
5 protein by previously described immunoassay techniques including ELISA, 
immunoblotting (western blotting), immunoprecipitation, BIACORE technology 
and the like. Conversely, the presence of antibodies may indicate an immune 
response against an endogenous ovarian cancer protein. 

In a preferred embodiment, in situ hybridization of labeled ovarian cancer 
10 nucleic acid probes to tissue arrays is done. For example, arrays of tissue 
samples, including ovarian cancer tissue and/or normal tissue, are made. In situ 
hybridization (see, e.g., Ausubel, supra) is then performed. When comparing the 
fingerprints between an individual and a standard, the skilled artisan can make a 
diagnosis, a prognosis, or a prediction based on the findings. It is further 
15 understood that the genes which indicate the diagnosis may differ from those 
which indicate the prognosis and molecular profiling of the condition of the cells 
may lead to distinctions between responsive or refractory conditions or may be 
predictive of outcomes. 

In a preferred embodiment, the ovarian cancer proteins, antibodies, nucleic 
20 acids, modified proteins and cells containing ovarian cancer sequences are used 
in prognosis assays. As above, gene expression profiles can be generated that 
correlate to ovarian cancer, in terms of long term prognosis. Again, this may be 
done on either a protein or gene level, with the use of genes being preferred. As 
above, ovarian cancer probes may be attached to biochips for the detection and 
25 quantification of ovarian cancer sequences in a tissue or patient. The assays 
proceed as outlined above for diagnosis. PCR method may provide more 
sensitive and accurate quantification. 

Preferred gene sequences, whose expression levels are to be determined 
as a means of diagnosing ovarian cancer, or a predisposition to ovarian cancer 
30 and/or providing long term prognosis are those referred to in the Tables that are 
listed in one or more of the Tables as having a P value of less than 0.05, more 
preferably a P value of less than 0.01 . Similarly, preferred sequences are those 
referred to in Table 3B as having a ratio of expression between ovarian patients 
and normal patients of at least 5, more preferably at least 6, 7, 8, 9 or 10, and 
35 those sequences referred to in Table 3B as having a ratio of expression between 
ovarian patients and normal patients of less than -5 or -10. 

The P values and ratios provided in the Tables for the various ovarian 
cancer-related sequences, including the survival, clinical recurrence and 
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biochemical recurrence P values in Tables 6, 7 and 8 provide detailed guidance 
to a person skilled in the art seeking to obtain a diagnosis/prognosis. 

Assays for therapeutic compounds 

5 The proteins, nucleic acids, and antibodies as described herein can be 

used in drug screening assays to identify candidate compounds for use in treating 
ovarian cancer. The ovarian cancer proteins, antibodies, nucleic acids, modified 
proteins and cells containing ovarian cancer sequences are used in drug 
screening assays or by evaluating the effect of drug candidates on a "gene 

10 expression profile" or expression profile of polypeptides. In a preferred 
embodiment, the expression profiles are used, preferably in conjunction with high 
throughput screening techniques to allow monitoring for expression profile genes 
after treatment with a candidate agent (e.g., Zlokarnik, et al., 1998, Science 279: 
84-88); Heid, 1996, Genome Res 6: 986-94). 

15 ,n a preferred embodiment, the ovarian cancer proteins, antibodies, nucleic 

acids, modified proteins and cells containing the native or modified ovarian 
cancer proteins are used in screening assays. That is, the present invention 
provides methods for screening for compounds/agents which modulate the 
ovarian cancer phenotype or an identified physiological function of a ovarian 

20 cancer protein. As above, this can be done on an individual gene level or by 
evaluating the effect of drug candidates on a "gene expression profile". In a 
preferred embodiment, the expression profiles are used, preferably in conjunction 
with high throughput screening techniques to allow monitoring for expression 
profile genes after treatment with a candidate agent, see Zlokarnik, supra. 

25 Having identified the differentially expressed genes herein, a variety of 

assays may be executed. In a preferred embodiment, assays may be run on an 
individual gene or protein level. That is, having identified a particular gene as up 
regulated in ovarian cancer, test compounds can be screened for the ability to 
modulate gene expression or for binding to the ovarian cancer protein. 

30 "Modulation" thus includes both an increase and a decrease in gene expression. 
The preferred amount of modulation will depend on the original change of the 
gene expression in normal versus tissue undergoing ovarian cancer, with 
changes of at least 10%, preferably 50%, more preferably 100-300%, and in 
some embodiments 300-1000% or greater. Thus, if a gene exhibits a 4-fold 

35 increase in ovarian cancer tissue compared to normal tissue, a decrease of about 
four-fold is often desired; similarly, a 10-fold decrease in ovarian cancer tissue 
compared to normal tissue often provides a target value of a 10-fold increase in 
expression to be induced by the test compound. 
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The amount of gene expression may be monitored using nucleic acid 
probes and the quantification of gene expression levels, or, alternatively, the gene 
product itself can be monitored, e.g., through the use of antibodies to the ovarian 
cancer protein and standard immunoassays. Proteomics and separation 

5 techniques may also allow quantification of expression. . 

In a preferred embodiment, gene expression or protein monitoring of a 
number of entities, i.e., an expression profile, is monitored simultaneously. Such 
profiles will typically involve a plurality of those entities described herein. 

In this embodiment, the ovarian cancer nucleic acid probes are attached to 

10 biochips as outlined herein for the detection and quantification of ovarian cancer 
sequences in a particular cell. Alternatively, PCR may be used. Thus, a series 
may be used with dispensed primers in desired wells. A PCR reaction can then 
be performed and analyzed for each well. 

Expression monitoring can be performed to identify compounds that modify 

is the expression of one or more ovarian cancer-associated sequences, e.g., a 
polynucleotide sequence set out in the Tables. In a preferred embodiment, a test 
modulator is added to the cells prior to analysis. Moreover, screens are also 
provided to identify agents that modulate ovarian cancer, modulate ovarian 
cancer proteins, bind to a ovarian cancer protein, or interfere with the binding of a 

20 ovarian cancer protein and an antibody or other binding partner. 

The term "test compound" or "drug candidate" or "modulator" or 
grammatical equivalents as used herein describes any molecule, e.g., protein, 
oligopeptide, small organic molecule, polysaccharide, polynucleotide, etc., to be 
tested for the capacity to directly or indirectly alter the ovarian cancer phenotype 

25 or the expression of a ovarian cancer sequence, e.g., a nucleic acid or protein 
sequence. In preferred embodiments, modulators alter expression profiles, or 
expression profile nucleic acids or proteins provided herein. In one embodiment, 
the modulator suppresses a ovarian cancer phenotype, e.g. to a normal tissue 
fingerprint. In another embodiment, a modulator induced a ovarian cancer 

30 phenotype. Generally, a plurality of assay mixtures are run in parallel with 
different agent concentrations to obtain a differential response to the various 
concentrations. Typically, one of these concentrations serves as a negative 
control, i.e., at zero concentration or below the level of detection. 

Drug candidates encompass numerous chemical classes, though typically 

35 they are organic molecules, preferably small organic compounds having a 
molecular weight of more than 100 and less than about 2,500 daltons. Preferred 
small molecules are less than 2000, or less than 1500 or less than 1000 or less 
than 500 Daltons. Candidate agents comprise functional groups necessary for 
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structural interaction with proteins, particularly hydrogen bonding, and typically 
include at least an amine, carbonyl, hydroxyl or carboxyl group, preferably at least 
two of the functional chemical groups. The candidate agents often comprise 
cyclical carbon or heterocyclic structures and/or aromatic or polyaromatic 
5 structures substituted with one or more of the above functional groups. Candidate 
agents are also found among biomolecules including peptides, saccharides, fatty 
acids, steroids, purines, pyrimidines, derivatives, structural analogs or 
combinations thereof. Particularly preferred are peptides. 

In one aspect, a modulator will neutralize the effect of a ovarian cancer 

10 protein. By "neutralize" is meant that activity of a protein is inhibited or blocked 
and the consequent effect on the cell. 

In certain embodiments, combinatorial libraries of potential modulators will 
be screened for an ability to bind to a ovarian cancer polypeptide or to modulate 
activity. Conventionally, new chemical entities with useful properties are 

15 generated by identifying a chemical compound (called a "lead compound") with 
some desirable property or activity, e.g., inhibiting activity, creating variants of the 
lead compound, and evaluating the property and activity of those variant 
compounds. Often, high throughput screening (HTS) methods are employed for 
such an analysis. 

20 In one preferred embodiment, high throughput screening methods involve 

providing a library containing a large number of potential therapeutic compounds 
(candidate compounds). Such "combinatorial chemical libraries" are then 
screened in one or more assays to identify those library members (particular 
chemical species or subclasses) that display a desired characteristic activity. The 

25 compounds thus identified can serve as conventional "lead compounds" or can 
themselves be used as potential or actual therapeutics. 

A combinatorial chemical library is a collection of diverse chemical 
compounds generated by either chemical synthesis or biological synthesis by 
combining a number of chemical "building blocks" such as reagents. For 

30 example, a linear combinatorial chemical library, such as a polypeptide (e.g., 
mutein) library, is formed by combining a set of chemical building blocks called 
amino acids in every possible way for a given compound length (i.e., the number 
of amino acids in a polypeptide compound). Millions of chemical compounds can 
be synthesized through such combinatorial mixing of chemical building blocks 

35 (Gallop etai, 1994, J. Med. Chem. 37(9): 1233-1 251). 

Preparation and screening of combinatorial chemical libraries is well known 
to those of skill in the art. Such combinatorial chemical libraries include, but are 
not limited to, peptide libraries, peptoids, encoded peptides, random 
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bio-oligomers, nonpeptidal peptidomimetics, analogous organic syntheses of 
small compound libraries, nucleic acid libraries, peptide nucleic acid libraries, 
antibody libraries, carbohydrate libraries and small organic molecule libraries. 

The assays to identify modulators are amenable to high throughput 

5 screening. Preferred assays thus detect enhancement or inhibition of ovarian 
cancer gene transcription, inhibition or enhancement of polypeptide expression, 
and inhibition or enhancement of polypeptide activity. 

High throughput assays for the presence, absence, quantification, or other 
properties of particular nucleic acids or protein products are well known to those 

10 of skill in the art. Similarly, binding assays and reporter gene assays are similarly 
well known. Thus, e.g., U.S. Patent No. 5,559,410 discloses high throughput 
screening methods for proteins, U.S. Patent No. 5,585,639 discloses high 
throughput screening methods for nucleic acid binding (i.e., in arrays), while U.S. 
Patent Nos. 5,576,220 and 5,541,061 disclose high throughput methods of 

15 screening for ligand/antibody binding. 

In addition, high throughput screening systems are commercially available 
(see, e.g., Zymark Corp., Hopkinton, MA; Air Technical Industries, Mentor, OH; 
Beckman Instruments, Inc. Fullerton, CA; Precision Systems, Inc., Natick, MA, 
efc.;. These systems typically automate entire procedures, including all samlsle 

20 and reagent pipetting, liquid dispensing, timed incubations, and final readings of 
the microplate in detectors) appropriate for the assay. These configurable 
systems provide high throughput and rapid start up as well as a high degree of 
flexibility and customization. The manufacturers of such systems provide detailed 
protocols for various high throughput systems. Thus, e.g., Zymark Corp. provides 

25 technical bulletins describing screening systems for detecting the modulation of 
gene transcription, ligand binding, and the like. 

In one embodiment, modulators are proteins, often naturally occurring 
proteins or fragments of naturally occurring proteins. Thus, e.g., cellular extracts 
containing proteins, or random or directed digests of proteinaceous cellular 

30 extracts, may be used. In this way libraries of proteins may be made for screening 
in the methods of the invention. Particularly preferred in this embodiment are 
libraries of bacterial, fungal, viral, and mammalian proteins, with the latter being 
preferred, and human proteins being especially preferred. Particularly useful test 
compound will be directed to the class of proteins to which the target belongs, 

35 e.g., substrates for enzymes or Hgands and receptors. 

In a preferred embodiment, modulators are peptides of from about 5 to 
about 30 amino acids, with from about 5 to about 20 amino acids being preferred, 
and from about 7 to about 15 being particularly preferred. The peptides may be 
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digests of naturally occurring proteins as is outlined above, random peptides, or 
"biased" random peptides. By "randomized" or grammatical equivalents herein is 
meant that each nucleic acid and peptide consists of essentially random 
nucleotides and amino acids, respectively. Since generally these random 
5 peptides (or nucleic acids, discussed below) are chemically synthesized, they 
may incorporate any nucleotide or amino acid at any position. The synthetic 
process can be designed to generate randomized proteins or nucleic acids, to 
allow the formation of all or most of the possible combinations over the length of 
the sequence, thus forming a library of randomized candidate bioactive 
10 proteinaceous agents. 

In one embodiment, the library is .fully randomized, with no sequence 
preferences or constants at any position. In a preferred embodiment, the library is 
biased. That is, some positions within the sequence are either held constant, or 
are selected from a limited number of possibilities. For example, in a preferred 

is embodiment, the nucleotides or amino acid residues are randomized within a 
defined class, e.g., of hydrophobic amino acids, hydrophilic residues, sterically 
biased (either small or large) residues, towards the creation of nucleic acid 
binding domains, the creation of cysteines, for cross-linking, prolines for SH-3 
domains, serines, threonines, tyrosines or histidines for phosphorylation sites, 

20 etc., or to purines, etc. 

Modulators of ovarian cancer can also be nucleic acids, as defined below. 
As described above generally for proteins, nucleic acid modulating agents may be 
naturally occurring nucleic acids, random nucleic acids, or "biased" random 
nucleic acids. For example, digests of procaryotic or eucaryotic genomes may be 
25 used as is outlined above for proteins. 

In certain embodiments, the activity of a ovarian cancer-associated protein 
is down-regulated, or entirely inhibited, by the use of antisense polynucleotide, 
i.e., a nucleic acid complementary to, and which can preferably hybridize 
specifically to, a coding mRNA nucleic acid sequence, e.g., a ovarian cancer 
protein mRNA, or a subsequence thereof. Binding of the antisense polynucleotide 
to the mRNA reduces the translation and/or stability of the mRNA. 

In the context of this invention, antisense polynucleotides can comprise 
naturally-occurring nucleotides, or synthetic species formed from 
naturally-occurring subunits or their close homologs. Antisense polynucleotides 
may also have altered sugar moieties or inter-sugar linkages. Exemplary among 
these are the phosphorothioate and other sulfur containing species which are 
known for use in the art. Analogs are comprehended by this invention so long as 
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they function effectively to hybridize with the ovarian cancer protein mRNA. See, 
e.g., Isis Pharmaceuticals, Carlsbad, CA; Sequitor, Inc., Natick, MA. 

Such antisense polynucleotides can readily be synthesized using 
recombinant means, or can be synthesized in vitro. Equipment for such synthesis 
5 is sold by several vendors, including Applied Biosystems. The preparation of 
other oligonucleotides such as phosphorothioates and alkylated derivatives is 
also well known to those of skill in the art. 

Antisense molecules as used herein include antisense or sense 
oligonucleotides. Sense oligonucleotides can, e.g., be employed to block 
10 transcription by binding to the anti-sense strand. The antisense and sense 
oligonucleotide comprise a single-stranded nucleic acid sequence (either RNA or 
DNA) capable of binding to target mRNA (sense) or DNA .(antisense) sequences 
for ovarian cancer molecules. Antisense or sense oligonucleotides, according to 
the present invention, comprise a fragment generally at least about 14 
15 nucleotides, preferably from about 14 to 30 nucleotides. The ability to derive an 
antisense or a sense oligonucleotide, based upon a cDNA sequence encoding a 
given protein is described, in, e.g., Stein & Cohen (Cancer Res. 48:2659 (1988 
and van der Krol et al. (BioTechniques 6:958 (1 988)). 

In addition to antisense polynucleotides, ribozymes can be used to target 
20 and inhibit transcription of ovarian cancer-associated nucleotide sequences. A 
ribozyme is an RNA molecule that catalytically cleaves other RNA- molecules. 
Different kinds of ribozymes have been described, including group I ribozymes 
hammerhead ribozymes, hairpin ribozymes, RNase P, and axhead ribozymes 
(see, e.g., Castanotto et al, Adv. in Pharmacology 25: 289-317 (1994) for a 
25 general review of the properties of different 5 ribozymes). 

Methods of preparing ribozymes are well known to those of skill in the art 
(see, e.g., WO 94/26877; Ojwang era/., Proc. Natl. Acad. Sci. USA 90:6340-6344 
(1993); Yamada et al., Human Gene Therapy 1:39-45 (1994)- Leavitt et al Proc 
Natl. Acad. Sci. USA 92:699- 703 (1995); Leavitt et al., Human Gene Therapy 
30 5:1151-120 ( 1994); and Yamada et al., Virology 205: 121-126 (1994)). 

Polynucleotide modulators of ovarian cancer may be introduced into a cell 
containing the target nucleotide sequence by formation of a conjugate with a 
ligand binding molecule, as described in WO 91/04753. Suitable ligand binding 
molecules include, but are not limited to, cell surface receptors, growth factors 
other cytokines, or other ligands that bind to cell surface receptors. Preferably' 
conjugation of the ligand binding molecule does not substantially interfere with the 
ability of the ligand binding molecule to bind to its corresponding molecule or 
receptor, or block entry of the sense or antisense oligonucleotide or its 
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conjugated version into the cell. Alternatively, a polynucleotide modulator of 
ovarian cancer may be introduced into a cell containing the target nucleic acid 
sequence, e.g., by formation of an polynucleotide-lipid complex, as described in 
WO 90/10448. It is understood that the use of antisense molecules or knock out 
and knock in models may also be used in screening assays as discussed above, 
in addition to methods of treatment. 

As noted above, gene expression monitoring is conveniently used to test 
candidate modulators (e.g., protein, nucleic acid or small molecule). After the 
candidate agent has been added and the cells allowed to incubate for some 
period of time, the sample containing a target sequence to be analyzed is added 
to the biochip. If required, the target sequence is prepared using known 
techniques. For example, the sample may be treated to lyse the cells, using 
known lysis buffers, electroporation, etc., with purification and/or amplification 
such as PCR performed as appropriate. For example, an in vitro transcription with 
labels covalently attached to the nucleotides is performed. Generally, the nucleic 
acids are labeled with biotin-FITC or PE, or with cy3 or cy5. 

In a preferred embodiment, the target sequence is labeled with, e.g., a 
fluorescent, a chemiluminescent, a chemical, or a radioactive signal, to provide a 
means of detecting the target sequence's specific binding to a probe. The label 
also can be an enzyme, such as, alkaline phosphatase or horseradish 
peroxidase, which when provided with an appropriate substrate produces a 
product that can be detected. Alternatively, the label can be a labeled compound 
or small molecule, such as an enzyme inhibitor, that binds but is not catalyzed or 
altered by the enzyme. The label also can be a moiety or compound, such as, an 
epitope tag or biotin which specifically binds to streptavidin. For the example of 
biotin, the streptavidin is labeled as described above, thereby, providing a 
detectable signal for the bound target sequence. Unbound labeled streptavidin is 
typically removed prior to analysis. 

As will be appreciated by those in the art, these assays can be direct 
hybridization assays or can comprise "sandwich assays", which include the use of 
multiple probes, as is generally outlined in U.S. Patent Nos. 5 681 702 
5,597,909, 5,545,730, 5,594,117, 5,591,584, 5,571,670, 5,580,731, 5 571 Wo" 
5,591,584, 5,624,802, 5,635,352, 5,594,118, 5,359,100, 5,124,246 ' and 
5,681,697, all of which are hereby incorporated by reference. In this embodiment 
in general, the target nucleic acid is prepared as outlined above, and then added 
to the biochip comprising a plurality of nucleic acid probes, under conditions that 
allow the formation of a hybridization complex. 
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A variety of hybridization conditions may be used in the present invention, 
including high, moderate and low stringency conditions as outlined above. The 
assays are generally run under stringency conditions which allows formation of 
the label probe hybridization complex only in the presence of target. Stringency 
5 can be controlled by altering a step parameter that is a thermodynamic variable, 
including, but not limited to, temperature, formamide concentration, salt 
concentration, chaotropic salt concentration pH, organic solvent concentration, 
etc. 

These parameters may also be used to control non-specific binding, as is 
10 generally outlined in U.S. Patent No. 5,681,697. Thus it may be desirable to 
perform certain steps at higher stringency conditions to reduce non-specific 
binding. 

The reactions outlined herein may be accomplished in a variety of ways. 
Components of the reaction may be added simultaneously, or sequentially, in 

15 different orders, with preferred embodiments outlined below. In addition, the 
reaction may include a variety of other reagents. These include salts, buffers, 
neutral proteins, e.g. albumin, detergents, etc. which may be used to facilitate 
optimal hybridization and detection, and/or reduce non-specific or background 
interactions. Reagents that otherwise improve the efficiency of the assay, such as 

20 protease inhibitors, nuclease inhibitors, anti-microbial agents, etc., may also be 
used as appropriate, depending on the sample preparation methods and purity of 
the target. 

The assay data are analyzed to determine the expression levels, and 
changes in expression levels as between states, of individual genes, forming a 

25 gene expression profile. 

Screens are performed to identify modulators of the ovarian cancer 
phenotype. In one embodiment, screening is performed to identify modulators 
that can induce or suppress a particular expression profile, thus preferably 
generating the associated phenotype. In another embodiment, e.g., for diagnostic 

30 applications, having identified differentially expressed genes important in a 
particular state, screens can be performed to identify modulators that alter 
expression of individual genes. In an another embodiment, screening is 
performed to identify modulators that alter a biological function of the expression 
product of a differentially expressed gene. Again, having identified the importance 

35 of a gene in a particular state, screens are performed to identify agents that bind 
and/or modulate the biological activity of the gene product. 

In addition screens can be done for genes that are induced in response to 
a candidate agent. After identifying a modulator based upon its ability to suppress 
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a ovarian cancer expression pattern leading to a norma! expression pattern, or to 
modulate a single ovarian cancer gene expression, profile so as to mimic the 
expression of the gene from normal tissue, a screen as described above can be 
performed to identify genes that are specifically modulated in response to the 
5 agent. Comparing expression profiles between normal tissue and agent treated 
ovarian cancer tissue reveals genes that are not expressed in normal tissue or 
ovarian cancer tissue, but are expressed in agent treated tissue. These agent- 
specific sequences can be identified and used by methods described herein for 
ovarian cancer genes or proteins. In particular these sequences and the proteins 

10 they encode find use in marking or identifying agent treated cells. In addition, 
antibodies can be raised against the agent induced proteins and used to target 
novel therapeutics to the treated ovarian cancer tissue sample. 

Thus, in one embodiment, a test compound is administered to a population 
of ovarian cancer cells, that have an associated ovarian cancer expression 

15 profile. By "administration" or "contacting" herein is meant that the candidate 
agent is added to the cells in such a manner as to allow the agent to act upon the 
cell, whether by uptake and intracellular action, or by action at the cell surface. In 
some embodiments, nucleic acid encoding a proteinaceous candidate agent (i.e., 
a peptide) may be put into a viral construct such as an adenoviral or retroviral 

20 construct, and added to the cell, such that expression of the peptide agent is 
accomplished. Regulatable gene administration systems can also be used. 

Once the test compound has been administered to the cells, the cells can 
be washed if desired and are allowed to incubate under preferably physiological 
conditions for some period of time. The cells are then harvested and a new gene 

25 expression profile is generated, as outlined herein. 

Thus, e.g., ovarian cancer tissue may be screened for agents that 
modulate, e.g., induce or suppress the ovarian cancer phenotype. A change in at 
least one gene, preferably many, of the expression profile indicates that the agent 
has an effect on ovarian cancer activity. By defining such a signature for the 
• 30 ovarian cancer phenotype, screens for new drugs that alter the phenotype can be 
devised. With this approach, the drug target need not be known and need not be 
represented in the original expression screening platform, nor does the level of 
transcript for the target protein need to change. 

In a preferred embodiment, as outlined above, screens may be done on 

35 individual genes and gene products (proteins). That is, having identified a 
particular differentially expressed gene as important in a particular state, 
screening of modulators of either the expression of the gene or the gene product 
itself can be done. The gene products of differentially expressed genes are 
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sometimes referred to herein as "ovarian cancer proteins" or a "ovarian cancer 
modulatory protein". The ovarian cancer modulatory protein may be a fragment, 
or alternatively, be the full length protein to the fragment encoded by the nucleic * 
acids referred to in the Tables. Preferably, the ovarian cancer modulatory protein 
is a fragment. In a preferred embodiment, the ovarian cancer amino acid 
sequence which is used to determine sequence identity or similarity is encoded 
by a nucleic acid referred to in the Tables. In another embodiment, the sequences ■ 
are naturally occurring allelic variants of a protein encoded by a nucleic acid 
referred to in the Tables. In another embodiment, the sequences are sequence 
variants as further described herein. 

Preferably, the ovarian cancer modulatory protein is a fragment of 
approximately 14 to 24 amino acids long. More preferably the fragment is a 
soluble fragment. Preferably, the fragment includes a non-transmembrane region. 
In a preferred embodiment, the fragment has an N-terrninal Cys to aid in 
solubility. In one embodiment, the C-terminus of the fragment is kept as a free 
acid and the N-terminus is a free amine to aid in coupling, i.e., to cysteine. 

In one embodiment the ovarian cancer proteins are conjugated to an 
immunogenic agent as discussed herein. In one embodiment the ovarian cancer 
protein is conjugated to BSA. 

Measurements of ovarian cancer polypeptide activity, or of ovarian cancer 
or the ovarian cancer phenotype can be performed using a variety of assays. For 
example, the effects of the test compounds upon the function of the ovarian 
cancer polypeptides can be measured by examining parameters described 
above. A suitable physiological change that affects activity can be used to assess 
the influence of a test compound on the polypeptides of this invention. When the 
functional consequences are determined using intact cells or animals, one can 
also measure a variety of effects such as, in the case of ovarian cancer 
associated with tumours, tumour growth, tumour metastasis, neovascularization, 
hormone release, transcriptional changes to both known and uncharacterized 
genetic markers (e.g., northern blots), changes in cell metabolism such as cell 
growth or pH changes, and changes in intracellular second messengers such as 
cGMP. In tire assays of the invention, mammalian ovarian cancer polypeptide is 
typically used, e.g., mouse, preferably human. 

Assays to identity compounds with modulating activity can be performed in 
vitro. For example, a ovarian cancer polypeptide is first contacted with a potential 
modulator and incubated for a suitable amount of time, e.g., from 0.5 to 48 hours. 
In one embodiment, the ovarian cancer polypeptide levels are determined in vitro 
by measuring the level of protein or mRNA. The level of protein is measured 
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using immunoassays such as western blotting, ELISA and the like with an 
antibody that selectively binds to the ovarian cancer polypeptide or a fragment 
thereof. For measurement of mRNA, amplification, e.g., using PCR, LCR, or 
hybridization assays, e.g., northern hybridization, RNAse protection, dot blotting, 
5 are preferred. The level of protein or mRNA is detected using directly or indirectly 
labeled detection agents, e.g., fluorescently or radioactively labeled nucleic acids, 
radioactively or enzymatically labeled antibodies, and the like, as described 
herein. 

Alternatively, a reporter gene system can be devised using the ovarian 
10 cancer protein promoter operably linked to a reporter gene such as luciferase, 
green fluorescent protein, CAT, or (beta-gal. The reporter construct is typically 
transfected into a cell. After treatment with a potential modulator, the amount of 
reporter gene transcription, translation, or activity is measured according to 
standard techniques known to those of skill in the art. 
is In a preferred embodiment, as outlined above, screens may be done on 

individual genes and gene products (proteins). That is, having identified a 
particular differentially expressed gene as important in a particular state, 
screening of modulators of the expression of the gene or the gene product itself 
can be done. The gene products of differentially expressed genes are sometimes 
20 referred to herein as "ovarian cancer proteins.' 1 The ovarian cancer protein may 
be a fragment, or alternatively, be the full length protein to a fragment shown 
herein. 

In one embodiment, screening for modulators of expression of specific 
genes is performed. Typically, the expression of only one or a few genes are 

25 evaluated. In another embodiment, screens are designed to first find compounds 
that bind to differentially expressed proteins. These compounds are then 
evaluated for the ability to modulate differentially expressed activity. Moreover, 
once initial candidate compounds are identified, variants can be further screened 
to better evaluate structure activity relationships. 

30 In a preferred embodiment, binding assays are done. In general, purified or 

isolated gene product is used; that is, the gene products of one or more 
differentially expressed nucleic acids are made. For example, antibodies are 
generated to the protein gene products, and standard immunoassays are run to 
determine the amount of protein present. Alternatively, cells comprising the 

35 ovarian cancer proteins can be used in the assays. 

Thus, in a preferred embodiment, the methods comprise combining a 
ovarian cancer protein and a candidate compound, and determining the binding 
of the compound to the ovarian cancer protein. Preferred embodiments utilize the 
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human ovarian cancer protein, although other mammalian proteins may also be 
used, e.g. for the development of animal models of human disease. In some 
embodiments, as outlined herein, variant or derivative ovarian cancer proteins 
may be used. 

5 Generally, in a preferred embodiment of the methods herein, the ovarian 

cancer protein or the candidate agent is non-diffusably bound to an insoluble 
support having isolated sample receiving areas (e.g. a microtiter plate, an array, 
etc.). The insoluble supports may be made of any composition to which the 
compositions can be bound, is readily separated from soluble material, and is 

10 otherwise compatible with the overall method of screening. The surface of such 
supports may be solid or porous and of any convenient shape. Examples of 
suitable insoluble supports include microtiter plates, arrays, membranes and 
beads. These are typically made of glass, plastic (e.g., polystyrene), 
polysaccharides, nylon or nitrocellulose, teflon™, etc. microtitre plates and arrays 

15 are especially convenient because a large number of assays can be carried out 
simultaneously, using small amounts of reagents and samples. The particular 
manner of binding of the composition is not crucial so long as it is compatible with 
the reagents and overall methods of the invention, maintains the activity of the 
composition and is nondiffusable. Preferred methods of binding include the use of 

20 antibodies (which do not sterically block either the ligand binding site or activation 
sequence when the protein is bound to the support), direct binding to "stick/' or 
ionic supports, chemical crosslinking, the synthesis of the protein or agent on the 
surface, etc. Following binding of the protein or agent, excess unbound material is 
removed by washing. The sample receiving areas may then be blocked through 

25 incubation with bovine serum albumin (BSA), casein or other innocuous protein or 
other moiety. 

In a preferred embodiment, the ovarian cancer protein is bound to the 
support, and a test compound is added to the assay. Alternatively, the candidate 
agent is bound to the support and the ovarian cancer protein is added. Novel 

30 binding agents include specific antibodies, non-natural binding agents identified in 
screens of chemical libraries, peptide analogs, etc. Of particular interest are 
screening assays for agents that have a low toxicity for human cells. A wide 
variety of assays may be used for this purpose, including labeled in vitro 
protein-protein binding assays, electrophoretic mobility shift assays, 

35 immunoassays for protein binding, functional assays (phosphorylation assays, 
etc.) and the like. 

The determination of the binding of the test modulating compound to the 
ovarian cancer protein may be done in a number of ways. In a preferred 
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embodiment, the compound is labeled, and binding determined directly, e.g., by 
attaching all or a portion of the ovarian cancer protein to a solid support, adding a 
labeled candidate agent (e.g., a fluorescent label), washing off excess reagent, 
and determining whether the label is present on the solid support. Various 

5 blocking and washing steps may be utilized as appropriate. 

In some embodiments, only one of the components is labeled, e.g., the 
proteins (or proteinaceous candidate compounds) can be labeled. Alternatively, 
more than one component can be labeled with different labels, e.g., 125 l for the 
proteins and a fluorophor for the compound. Proximity reagents, e.g., quenching 

10 or energy transfer reagents are also useful. 

In one embodiment, the binding of the test compound is determined by 
competitive binding assay. The competitor is a binding moiety known to bind to 
the target molecule (i.e., a ovarian cancer protein), such as an antibody, peptide, 
binding partner, ligand, etc. Under certain circumstances, there may be 

15 competitive binding between the compound and the binding moiety, with the 
binding moiety displacing the compound. In one embodiment, the test compound 
is labeled. Either the compound, or the competitor, or both, is added first to the 
protein for a time sufficient to allow binding, if present Incubations may be 
performed at a temperature which facilitates optimal activity, typically between 4 

20 and 40°C. Incubation periods are typically optimized, e.g., to facilitate rapid high 
throughput screening. Typically between 0.1 and 1 hour will be sufficient. Excess 
reagent is generally removed or washed away. The second component is then 
added, and the presence or absence of the labeled component is followed, to 
indicate binding. 

25 In a preferred embodiment, the competitor is added first, followed by the 

test compound. Displacement of the competitor is an indication that the test 
compound is binding to the ovarian cancer protein and thus is capable of binding 
to, and potentially modulating, the activity of the ovarian cancer protein. In this 
embodiment, either component can be labeled. Thus, e.g., if the competitor is 

30 labeled, the presence of label in the wash solution indicates displacement by the 
agent. Alternatively, if the test compound is labeled, the presence of the label on 
the support indicates displacement. 

In an alternative embodiment, the test compound is added first, with 
incubation and washing, followed by the competitor. The absence of binding by 

35 the competitor may indicate that the test compound is bound to the ovarian 
cancer protein with a higher affinity. Thus, if the test compound is labeled, the 
presence of the label on the support, coupled with a lack of competitor binding, 
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may indicate that the test compound is capable of binding to the ovarian cancer 
protein. 

In a preferred embodiment, the methods comprise differential screening to 
identity agents that are capable of modulating the activity of the ovarian cancer 
prote.ns. In this embodiment, the methods comprise combining a ovarian cancer 
protein and a competitor in a first sample. A second sample comprises a test 
compound, a ovarian cancer protein, and a competitor. The binding of the 
competitor is determined for both samples, and a change, or difference in binding 
between the two samples indicates the presence of an agent capable of binding 
to the ovarian cancer protein and potentially modulating its activity. That is if the 
binding of the competitor is different in the second sample relative to the first 
sample, the agent is capable of binding to the ovarian cancer protein 

Alternatively, differential screening is used to identify drug candidates that 
bind to the native ovarian cancer protein, but cannot bind to modified ovarian 
cancer proteins. The structure of the ovarian cancer protein may be modeled and 
used in rational drug design to synthesize agents that interact with that site Drug 
cand,dates that affect the activity of a ovarian cancer protein are also identified by 
screening drugs for the ability to either enhance or reduce the activity of the 
protein. 

Positive controls and negative controls may be used in the assays 
Preferably control and test samples are performed in at least triplicate to obtain 
statistically significant results. Incubation of all samples is for a time sufficient for 
he binding of the agent to the protein. Following incubation, samples are washed 
free of non-specificaily bound material and the amount of bound, generally 
labeled agent determined. For example, where a radiolabel is employed the 
samples may be counted in a scintillation counter to determine the amount of 
bound compound. 

A variety of other reagents may be included in the screening assays. These 
.nclude reagents like salts, neutral proteins, e.g. albumin, detergents, etc. which 
may be used to facilitate optimal protein-protein binding and/or reduce 
non-specific or background interactions. Also reagents that otherwise improve the 
efficiency of the assay, such as protease inhibitors, nuclease inhibitors, 
anti-microb.al agents, etc., may be used. The mixture of components may be 
added in an order that provides for the requisite binding. 

In a preferred embodiment, the invention provides methods for screening 
for a compound capable of modulating the activity of a ovarian cancer protein 
The methods comprise adding a test compound, as defined above, to a cell 
composing ovarian cancer proteins. Preferred cell types include almost any cell 
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The cells contain a recombinant nucleic acid that encodes a ovarian cancer 
protein. In a preferred embodiment, a library of candidate agents are tested on a 
plurality of cells. 

In one aspect, the assays are evaluated in the presence or absence or 

5 previous or subsequent exposure of physiological signals, e.g. hormones, 
antibodies, peptides, antigens, cytokines, growth factors, action potentials, 
pharmacological agents including chemotherapeutics, radiation, carcinogenics, or 
other cells (i.e. cell-cell contacts). In another example, the determinations are 
determined at different stages of the cell cycle process. 

10 In this way, compounds that modulate ovarian cancer agents are identified. 

Compounds with pharmacological activity are able to enhance or interfere with 
the activity of the ovarian cancer protein. Once identified, similar structures are 
evaluated to identify critical structural feature of the compound. 

In one embodiment, a method of inhibiting ovarian cancer cell division is 

15 provided. The method comprises administration of a ovarian cancer inhibitor. In 
another embodiment, a method of inhibiting ovarian cancer is provided. The 
method comprises administration of a ovarian cancer inhibitor. In a further 
embodiment, methods of treating cells or individuals with ovarian cancer are 
provided. The method comprises administration of a ovarian cancer inhibitor. 

20 In one embodiment, a ovarian cancer inhibitor is an antibody as discussed 

above. In another embodiment, the ovarian cancer inhibitor is an antisense 
molecule. 

A variety of cell growth, proliferation, and metastasis assays are known to 
those of skill in the art, as described below. 

25 

Soft agar growth or colony formation in suspension 

Normal cells require a solid substrate to attach and grow. When the cells 
are transformed, they lose this phenotype and grow detached from the substrate. 
For example, transformed cells can grow in stirred suspension culture or 

30 suspended in semi-solid media, such as semi-solid or soft agar. The transformed 
cells, when transfected with tumour suppressor genes, regenerate normal 
phenotype and require a solid substrate to attach and grow. Soft agar growth or 
colony formation in suspension assays can be used to identity modulators of 
ovarian cancer sequences, which when expressed in host cells, inhibit abnormal 

35 cellular proliferation and transformation. A therapeutic compound would reduce or 
eliminate the host cells 1 ability to grow in stirred suspension culture or suspended 
in semisolid media, such as semi-solid or soft. 
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Techniques for soft agar growth or colony formation in suspension assays 
are described in Freshney, Culture of Animal Cells a Manual of Basic Technique 
(3rd ed., 1994), herein incorporated by reference. See also, the methods section 
of Garkavtsev et al. (I996), supra, herein incorporated by reference. 

5 

Contact inhibition and density limitation of growth 

Normal cells typically grow in a flat and organized pattern in a petri dish 
until they touch other cells. When the cells touch one another, they are contact 
inhibited and stop growing. When cells are transformed, however, the cells are 

io not contact inhibited and continue to grow to high densities in disorganized foci. 
Thus, the transformed cells grow to a higher saturation density than normal cells. 
This can be detected morphologically by the formation of a disoriented monolayer 
of cells or rounded cells in foci within the regular pattern of normal surrounding 
cells. Alternatively, labeling index with ( 3 H)-thymidine at saturation density can be 

15 used to measure density limitation of growth. See Freshney (1994), supra. The 
transformed cells, when transfected with tumour suppressor genes, regenerate a 
normal phenotype and become contact inhibited and would grow to a lower 
density. 

In this assay, labeling index with ( 3 H)4hymidine at saturation density is a 
20 preferred method of measuring density limitation of growth. Transformed host 
cells are transfected with a ovarian cancer-associated sequence and are grown 1 
for 24 hours at saturation density in non-limiting medium conditions. The 
percentage of cells labeling with ( 3 H)-thymidine is determined 
autoradiographically. See, Freshney (1994), supra. 

25 

Growth factor or serum dependence 

Transformed cells have a lower serum dependence than their normal 
counterparts (see, e.g., Temin, J. Natl. Cancer I nsti. 37:167-175 (1966); Eagle ef 
al, J. Exp. Med. 131:836-879 (1970)); Freshney, supra. This is in part due to 

30 release of various growth factors by the transformed cells. Growth factor or serum 
dependence of transformed host cells can be compared with that of control. 
Tumor specific markers levels Tumor cells release an increased amount of 
certain factors (hereinafter "tumour specific markers 11 ) than their normal 
counterparts. For example, plasminogen activator (PA) is released from human 

35 glioma at a higher level than from normal brain cells (see, e.g., Gullino, 
Angiogenesis, tumour vascularization, and potential interference with tumour 
growth, in Biological Responses in Cancer, pp. 178-184 (Mihich (ed.) 1985)). 
Similarly, Tumor angiogenesis factor (TAF) is released at a higher level in tumour 
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cells than their normal counterparts. See, e.g., Folkman, Angiogenesis and 
Cancer, Sem Cancer Biol. (1992)). Various techniques which measure the 
release of these factors are described in Freshney (1994), supra. Also, see, 
Unkless et a/. , J. Biol. Chem. 249:4295-4305 (1974); Strickland & Beers, J. Biol. 
5 Chem. 251:5694-5702 (1976); Whur et af., Br. J. Cancer 42:305 312 (1980); 
Gullino, Angiogenesis, tumour vascularization, and potential interference with 
tumour growth, in Biological Responses in Cancer, pp. 178-184 (Mihich (ed.) 
1985); Freshney Anticancer Res. 5: 111-130 (1985). 

10 Invasiveness into Matrigei 

The degree of invasiveness into Matrigel-or some other extracellular matrix 
constituent can be used as an assay to identify compounds that modulate ovarian 
cancer-associated sequences. Tumor cells exhibit a good correlation between 
malignancy and invasiveness of cells into Matrigei or some other extracellular 

15 matrix constituent. In this assay, tumourigenic cells are typically used as host 
cells. Expression of a tumour suppressor gene in these host cells would decrease 
invasiveness of the host cells. 

Techniques described in Freshney (1 994), supra, can be used. Briefly, the 
level of invasion of host cells can be measured by using filters coated with 

20 Matrigei or some other extracellular matrix constituent Penetration into the gel, or 
through to the distal side of the filter, is rated as invasiveness, and rated 
histologically by number of cells and distance moved, or by prelabeling the cells 
with 125 1 and counting the radioactivity on the distal side of the filter or bottom of 
the dish. See, e.g., Freshney (1 984), supra. 

25 

Tumor growth in vivo 

Effects of ovarian cancer-associated sequences on cell growth can be 
tested in transgenic or immune-suppressed mice! Knock-out transgenic mice can 
be made, in which the ovarian cancer gene is disrupted or in which a ovarian 

30 cancer gene is inserted. Knock- out transgenic mice can be made by insertion of 
a marker gene or other heterologous gene into the endogenous ovarian cancer 
gene site in the mouse genome via homologous recombination. Such mice can 
also be made by substituting the endogenous ovarian cancer gene with a 
mutated version of the ovarian cancer gene, or by mutating the endogenous 

35 ovarian cancer gene, e.g., by exposure to carcinogens. 

A DNA construct is introduced into the nuclei of embryonic stem cells. Cells 
containing the newly engineered genetic lesion are injected into a host mouse 
embryo, which is re-implanted into a recipient female. Some of these embryos 
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develop into chimeric mice that possess germ cells partially derived from the 
mutant cell line. Therefore, by breeding the chimeric mice it is possible to obtain a 
new line of mice containing the introduced genetic lesion (see, e.g. Capecchi et 
ai, Science 244:1288 (1989)). Chimeric targeted mice can be derived according 
5 to Hogan et al., Manipulating the Mouse Embryo: A Laboratory Manual Cold 
Spring Harbor Laboratory (1988) and Teratocarcinomas and Embryonic Stem 
Cells: A Practical Approach, Robertson, ed., IRL Press, Washington, D.C., 
(1987), 

Alternatively, various immune-suppressed or immune-deficient host 
io animals can be used. For example, genetically athymic "nude" mouse (see, e.g 
Giovanella ef al., J. Natl. Cancer Inst. 52:921 (1974)), a SCID mouse, a 
thymectomized mouse, or an irradiated mouse (see, e.g., Bradley et al Br. J 
Cancer 38:263 (1978); Selby et al., Br. J. Cancer 41:52 (1980)) can be used as a 
host. Transplantable tumour cells (typically about 10 6 cells) injected into isogenic 
is hosts will produce invasive tumours in a high proportions of cases, while normal 
cells of similar origin will not. In hosts which developed invasive tumours, cells 
expressing a ovarian cancer-associated sequences are injected subcutaneously 
After a suitable length of time, preferably 4 to 8 weeks, tumour growth is 
measured (e.g. by volume or by its two largest dimensions) and compared to the 
20 control. Tumours that have a statistically significant reduction (using, e.g. 
Student's T test) are said to have inhibited growth. 

Administration 

Proteins and polynucleotides of the invention and substances identified or 
25 identifiable by the assay methods of the invention as modulating the activity of the 
ovanan cancer-associated proteins/polynucleotides can be administered to 
patients, therapeutically. Typically, such proteins/polynucleotides and substances 
may preferably be combined with various components to produce compositions of 
the invention. Preferably the compositions are combined with a pharmaceutically 
30 acceptable carrier or diluent to produce a pharmaceutical composition (which may 
be for human or animal use). Suitable carriers and diluents include isotonic saline 
solutions, for example phosphate-buffered saline. The composition of the 
invention may be administered by direct injection. The composition may be 
formulated for parenteral, intramuscular, intravenous, subcutaneous, intraocular 
35 oral, vaginal or transdermal administration. Typically, each protein may be 
administered at a dose of from 0.01 to 30 mg/kg body weight, preferably from 0.1 
to 10 mg/kg, more preferably from 0.1 to 1 mg/kg body weight 
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Polynucleotides/vectors encoding polypeptide components for use in 
modulating the activity of the ovarian cancer-associated proteins/polynucleotides 
may be administered directly as a naked nucleic acid construct. When the 
polynucleotides/vectors are administered as a naked nucleic acid, the amount of 
5 nucleic acid administered may typically, be in the range of from 1 ug to 10 mg, 
preferably from 100 ug to 1 mg. 

Uptake of naked nucleic acid constructs by mammalian cells is enhanced 
by several known transfection techniques for example those including the use of 
transfection agents. Example of these agents include cationic agents (for 

io example calcium phosphate and DEAE-dextran) and lipofectants (for example 
lipofectam™ and transfectam™). Typically, nucleic acid constructs are mixed with 
the transfection agent to produce a composition. 

Preferably the polynucleotide or vector of the invention is combined with a 
pharmaceutically acceptable carrier or diluent to produce a pharmaceutical 

15 composition. Suitable carriers and diluents include isotonic saline solutions, for 
example phosphate-buffered saline. The composition may be formulated for 
parenteral, intramuscular, intravenous, subcutaneous, oral, intraocular or 
transdermal administration. 

The pharmaceutical compositions can be administered in a range of unit 

20 dosage forms depending on the method of administration. For example, unit 
dosage forms suitable for oral administration include, powder, tablets, ' pills, 
capsules and lozenges. Orally administered dosage forms will typically be 
formulated to protect the active ingredient from digestion and may therefore be 
complexed with appropriate earner molecules and/or packaged in an 

25 appropriately resistant carrier. Suitable carrier molecules and packaging 
materials/barrier materials are known in the art. 

The compositions of the invention can be administered for therapeutic or 
prophylatic treatments. In therapeutic applications, compositions are 
administered to a patient suffering from a disease (e.g. ovarian cancer) in an 

30 amount sufficient to cure or at least partially ameliorate the disease and its 
complications. An amount adequate to accomplish this is defined as a 
-therapeutically effective dose". An amount of the composition that is capable of 
preventing or slowing the development of cancer in a patient is referred to as a 
"prophylactically effective dose". 

35 The routes of administration and dosages described are intended only as a • 

guide since a skilled practitioner will be able to determine readily the optimum 
route of administration and dosage for any particular patient and condition. 
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Experimental - Identification of genes associated with ovarian cancer 
development and outcome using transcript profiles. 

A cohort of 66 primary ovarian cancers and borderline tumours sampled at 
5 the time of primary laparotomy, and 4 normal ovary samples were collected for 
the purpose of transcript profiling. Total RNA was extracted from the fresh-frozen 
specimens with Trizol reagent (Life Technologies, Rockville, MD) and was 
reverse transcribed using a primer containing oligo(dT) and a T7 promoter 
sequence. The resulting cDNAs were then in vitro transcribed in the presence of 

10 biotinylated nucleotides (Bio-1 1-CTP and Bio-16-UTP) using the T7 MEGAscript 
kit (Ambion, Austin, TX). The biotinylated targets were analyzed using a 
customized Affymetrix oligonucleotide microarray representative of over 90% of 
the expressed human genome. Data from these experiments have been mined to 
identify ovarian-specific genes, genes that are overexpressed in ovarian cancer, 

is and genes that cosegregate with disease outcome following primary laporatomy. ' 
These analyses have identified a series of genes that encode overexpressed cell 
surface molecules and are thus therapeutic targets as well as transcription 
factors, protein tyrosine phosphatases and protein kinases, and genes involved in 
cell proliferation and cell adhesion. In addition, we have identified >100 genes 

20 that are associated with a greater risk of ovarian cancer clinical and biochemical 
recurrence and overall survival.. 

The results are presented in the following tables. 

Table 1 - genes whose expression is upregulated in ovarian cancer patients vs 
25 non-ovarian cancer patients. The statistical significance of the upregulation 
(P value) is also given. 

Table 2 - genes whose expression is down regulated in ovarian cancer patients 
vs non-ovarian cancer patients. The statistical significance of the upregulation 
30 (P value) is also given. 

Table 3 - average ratio of expression of specific genes between ovarian cancer 
patients vs non-ovarian cancer patients. The higher the ratio the more highly 
over-expressed the gene is in ovarian cancer patients. Negative ratios indicate 
35 under expression in ovarian cancer patients. 

Table 4 - genes overexpressed/underexpressed In mucinous ovarian cancer 
patients (with P values). 
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Table 5 - genes overexpressed/underexpressed in borderline ovarian cancer 
patients (with P values or ratios). ' 

5 Table 6 - statistical correlation between overexpression/underexpression and 
death of patient. The lower the P value the more likely it is that a patient having 
altered expression of the gene will die from the cancer. 

Table 7 - statistical correlation between overexpression/underexpression and 
10 clinical recurrence. The lower the P value the more likely it is that a patient 
having altered expression of the gene will experience recurrence of the disease. 

Table 8 - statistical correlation between overexpression/underexpression and 
bioclinical recurrence (defined as a clinically significant increase in the levels of 
15 the known biochemical marker CA125). The lower the P value the more likely it is 
that a patient having altered expression of the gene will experience biochemical 
recurrence of the disease. 

All publications mentioned in the above specification are herein 
20 incorporated by reference. Various modifications and variations of the described 
methods and system of the invention will be apparent to those skilled in the art 
without departing from the scope of the invention. Although the invention has 
been described in connection with specific preferred embodiments, it should be 
understood that the invention as claimed should not be unduly limited to such 
25 specific embodiments. Indeed, various modifications of the described modes for 
carrying out the invention which are apparent to those skilled in molecular biology 
or related fields are intended to be within the scope of the invention. 

Dated this fifth day of September 2002 

Garvan Institute of Medical Research 
Patent Attorneys for the Applicant: 
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